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CHAPTER  1 


INTRODUCTION 

Recently  the  Generalized  Hessenberg  Representation  (GHR)  was 
introduced  as  a  particular  representation  of  a  linear  time-invariant  control 
system  [1,2].  This  representation  was  obtained  by  a  constructive  algorithm 
called  chained  aggregation  [1,2].  Only  the  elementary  properties  of  the 
GHR  and  chained  aggregation  were  known  at  this  time.  It  is  the  purpose  of 
this  thesis  to  present  an  indepth  study  of  the  fundamental  properties  of 
the  GHR  and  chained  aggregation.  The  GHR  has  been  used  as  model  reduction 
technique  and  as  the  basis  of  a  control  design  scheme.  By  describing  the 
fundamental  structure  of  the  GHR,  insight  is  gained  into  loth  of  these 
methodologies  by  both  a  simplification  of  the  presentation  and  an  extension 
of  the  previous  results.  However,  this  investigation  has  wider  implications. 
It  turns  out  that  the  GHR  is  connected  to  several  basic  linear  system 
properties  which  recur  throughout  the  literature.  Thus  the  GHR  is 
emerging  as  a  common  framework  for  the  investigation  of  many  linear  system 
problems.  It  is  also  the  purpose  of  this  thesis  to  lay  the  groundwork  for 
future  investigations. 

Perhaps  the  most  important  recent  contribution  to  linear  systems  has 
been  the  introduction  of  geometric  techniques  [3].  While  not  detracting 
from  the  importance  of  this  contribution,  it  should  be  noted  that  these 
methods  are  complementary  to  the  older  matrix  methods  of  linear  system 
theory  just  as  a  matrix  can  be  an  array  of  numbers  or  an  abstract  operator. 
Perhaps  the  greatest  understanding  of  linear  system  will  come  when  these 
two  approachs  are  fully  merged.  It  is  the  general  approach  of  this  thesis 


to  make  explicit  use  of  both  of  these  techniques  for  an  indepth  study  of 
the  GHR. 

The  original  presentation  of  the  GHR  was  in  a  completely  matrix 
format  [1-2].  The  starting  point  and  most  fundamental  result  (for  this 
work)  is  a  geometric  interpretation  of  the  GHR  and  chained  aggregation. 

This  turns  out  to  be  a  sequence  of  subspaces  {X^},  called  here  1-th 
unobservability  subspaces.  These  subspaces  are  well  known  [4-5]  in  the 
literature,  however,  they  are  usually  introduced  to  identify  a  particular 
subspace,  XQ,  which  turns  out  to  be  the  unobservable  subspace.  The  other 
subspaces  do  not  seem  to  have  been  exploited. 

Having  this  twofold  interpretation  of  GHR,  we  are  able  to  extend 
the  understanding  of  the  GHR  in  three  directions.  First,  we  introduce 
a  metric  on  the  subspaces  {X^}.  While  this  is  easy  enough  to  do  in  an 
abstract  setting  [6],  it  is  the  GHR  which  provides  the  quantities  to 
estimate  relevant  distances.  Secondly,  we  study  the  subspaces  {X^}  under 
the  action  of  the  input.  By  modifying  chained  aggregation,  the  GHR 
identifies  the  interaction  of  the  input  and  output  and  so  provides  a 
natural  vehicle  to  study  the  closed-loop  version  of  {/^},  {X^}.  Thirdly, 
we  study  the  implicit  system  decomposition  induced  by  the  subspaces  {X^} 
and  their  closed  loop  relatives.  By  representing  the  system  in  an  explicit 
basis,  we  are  able  to  identify  how  physical  subsystems  relate  to  the  more 
abstract  information  subsystems  identified  in  the  GHR. 

Perhaps  the  major  contribution  of  this  work  is  the  unification 
of  these  ideas  into  a  single  framework,  the  GHR,  so  that  their  interaction 
can  be  evaluated  in  a  single  context.  There  are  a  number  of  benefits 
which  follow  from  this  unification,  many  of  which  have  not  been  explored 
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yet.  In  order  to  show  the  usefulness  and  flexibility  of  this  approach,  we 
discuss  model  reduction  and  control  design  using  the  GHR. 

By  combining  the  system  decomposition  with  the  topological 
characterization  of  {X^},  we  obtain  a  unified  approach  to  several  standard 
model  reduction  techniques  [7-11].  The  GHR  was  originally  Introduced  as  a 
reduced  order  modeling  technique  [1],  and  we  are  able  to  deepen  our  under¬ 
standing  of  its  role  in  model  reduction.  If  the  system  decomposition  is 
combined  with  the  closed  loop  subspaces  {X^},  we  obtain  a  control  design 
procedure  called  the  Three  Control  Component  Design  (TCCD) .  Originally 
introduced  in  [2]  in  a  matrix  format,  the  geometrical  interpretation  obtained 
here  not  only  simplifies  the  presentation,  but  extends  those  results  by 
removing  some  of  the  original  restrictions  and  applying  the  procedure  to 
other  problems  such  as  noninteraction  problems,  nonlinear  systems,  and 
dynamic  compensators.  We  feel  that  even  more  Insight  will  be  gained  by 
combining  these  ideas  with  the  topological  characterizations  of  {X^}.  A 
few  preliminary  results  in  this  direction  are  discussed. 

While  combining  these  ideas  is  important,  some  of  the  results 
are  of  interest  in  themselves.  In  particular,  the  topological  characteri¬ 
zation  of  the  {Xi>  seems  to  be  the  first  result  in  this  direction.  A 
related  approach  was  used  in  [12]  to  measure  the  distance  betveen  two 
particular  subspaces,  but  it  is  not  clear  that  this  could  be  generalized 
beyond  this  special  situation.  The  result  of  this  analysis  is  a  measure  of 
the  degree  of  observability.  The  approach  here  is  different  from  past 
approaches  [13-19 J  and  the  results  are  slightly  different.  These  relation¬ 
ships  are  discussed  in  detail  below. 


In  discussing  che  behavior  of  Che  subspaces  under  feedback, 

we  identify  a  maximal  set  {£*}.  These  subspaces  turn  out  to  be  well  known 
[20-22]  with  a  matrix  version  of  them  appearing  earliest  as  Silverman's 
Structure  Algorithm  [21].  A  completely  abstract  characterization  is  given 
by  Wonham  [3]  and  Baslle  and  Marro  [22].  However,  we  are  able  to  identify 
other  sets  of  closed  loop  subspaces  {JE^}  which  have  Important  implication 
for  compensator  design.  These  characterizations  appear  to  be  new.  The 
connection  between  {£^}  and  {jC^}  is  also  new. 

The  algorithms  developed  in  the  analysis  of  these  problems 
present  themselves  for  numerical  computations.  Concurrent  with  this  work 
there  has  appeared  a  numerical  analysis  of  chained  aggregation  [22,23]  and 
the  algorithm  to  compute  {£*}  [24].  These  papers  were  presented  from  a 
computational  point  of  view  which  differs  greatly  from  the  approach  here. 

We  note  that  the  numerical  success  apparently  rests  on  the  use  of  orthogonal 
transformations,  of  which  we  make  theoretical  use  here.  Hence,  the  proce¬ 
dure  here,  particularly  the  design  procedures,  show  great  promise  for 
numerical  implementation.  This  work  is  underway. 

This  thesis  is  organized  as  follows.  The  GHR  and  chained  aggrega¬ 
tion  are  briefly  reviewed  in  Chapter  2  and  the  main  results  are  presented  in 
Chapters  3-5.  These  results  discuss,  respectively,  the  open  loop  geometry, 
including  the  topological  characterizations,  the  closed  loop  geometry,  and 
system  decomposition.  Chapters  6-8  use  these  results  in  discussion  well- 
known  problems  such  as  model  reduction  and  compensator  design.  This  shows 
the  usefulness  of  the  GKR  framework  as  well  as  identifying  problems  for 
future  study.  The  conclusions  are  in  Chapter  9. 
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Notation.  All  matrices  and  maps  shall  be  denoted  by  capital 
Roman  letters  and  vector  spaces  by  capital  script  letters.  If  kiX+y  then 
the  null  space  of  A  is  denoted  by  fl[A]  and  the  range  space  of  A  by  A[A]. 

The  dimension  of  a  vector  space  is  denoted  by  d(«).  By 


(1.1) 


we  mean  the  vector  space  generated  by  the  given  vector  as  the  nonzero 
elements  vary  over  all  of  the  reals.  If  the  subvector  x  in  (1.1)  is 
replaced  by  a  matrix  X,  then  (1.1)  means  the  vector  space  spanned  by  the 
columns  of  that  matrix. 

The  set  of  eigenvalues  of  a  matrix  A  are  denoted  by  X (A) .  An 


element  of  this  set  is  denoted  by  X^(A).  The  largest  and  smallest  eigen¬ 
values  of  A  are  denoted  X(A),  _X (A) ,  respectively.  The  set  of  singular 
values  of  A  are  denoted  by  a (A)  with  other  notation  like  the  eigenvalue 
notation.  Recall  that  if  k-.X+y  where  X  is  an  n-dimensional  vector  space 
equipped  with  the  two-norm,  then  lAl  *a(A).  In  this  thesis,  we  shall  use 
only  two  norms.  If  two  vector  spaces  X  and  y  are  orthogonal  to  each  other 
write  y.  The  orthogonal  complement  of  a  subspace  X  is  Z  ^  • 

The  trace  of  A  is  written  tr  A. 

The  expectation  of  a  random  variable  is  written  E{-}. 

Define 


<AlB>i  -  «[B  +  AB  +  .-U  1], 


(1.2) 


Then  (AjR)  »<A|B>  is  the  controllable  space  of  the  pair  (A,B). 


PRELIMINARIES 


2.1.  The  Model 

In  this  thesis  we  will  be  concerned  mainly  with  continuous  linear 
time-invariant  systems  of  the  form 

x(t)  ■  Ax(t)  +  Bu(t),  x(0),  t >  0  (C.l) 

y(t)  -  Cx(t)  (C.2) 

where  xeRa,  u6R°,  y€Rn,  and  (A,B,C)  are  appropriately  dimensioned  constant 
matrices.  Denote  the  state,  input  and  output  spaces  byZ  *Rn,t(-Rm,  and 
7*  Rr,  respectively.  Then  we  can  think  of  (A,B,C)  as  representing  maps. 

In  fact,  many  of  our  results  are  derived  from  the  algebraic  and  geometric 
properties  of  (A,B,C)  and,  therefore,  apply  equally  well  to  discrete  systems 
of  the  form  (C.1)-(C.2).  Usually  we  shall  not  state  the  results  for  discrete 
systems  except  in  discussion  certain  subspaces  in  Chapters  3  and  4.  These 
subspaces  have  very  nice  interpretations  in  the  discrete  case  which  do  not 
exist  for  continuous  systems. 

Most  of  our  results  also  generalize  to  systems  in  which  (C.2)  is 
replaced  by  1 

y(t)  -  Cx(t)  +  Du(t).  (C.2') 

For  the  most  part,  these  extensions  are  straightforward.  A  special  section 
is  added  where  necessary. 

Without  loss  of  generality,  we  will  always  assume  B  has  full 
column  rank  and  C  has  full  row  rank.  We  make  no  special  assumptions 


concerning  the  observability  or  controllability  of  (C.1)-(C.2) .  In  fact, 
these  concepts  are  so  fundamental  we  rarely  mention  them  explicitly.  In 
most  cases  it  is  obvious  where  these  assumptions  spply. 


2.2.  The  GHR 

The  basis  of  this  thesis  is  a  representation  of  (C.1)-(C.2),  called 
the  Generalized  Hcssenberg  Representation  (GHR)  [  1  ] .  A  system  can  be  trans¬ 
formed  into  a  GHR  by  chained  aggregation  [  1-2  ].  This  consists  of  con¬ 
structing  a  finite  sequence  of  state  apace  transformations  as  follows.  Let 
T^  be  an  nxn  nonsingular  matrix  such  that 

CT1  -  [Cx  0]  (2.2.1) 

i  where  has  full  column  rank  r^.  (Since  we  have  assumed  C  to  have  full  row 

rank,  is  a  square  nonsingular  matrix  and  r^-r.  We  shall  use  this  fact 
without  explicitly  stating  it.)  Interpret  T^  as  a  state  space  transformation 

I 


(2.2.2) 


and  apply  (2.2.2)  to  (C.1)-(C.2)  to  obtain 


(2.2.3) 


For  discrete  systems,  think  reachability. 


This  completes  Che  first  step  of  chained  aggregation.  If  A^»0  or 
d(x^)*0,  then  the  algorithm  terminates  and  the  system  is  in  GHR. 

If  neither  of  these  conditions  hold,  then  the  second  step  (second 
transformation)  is  this.  *  Let  and  be  r^xr^  and  (n-r^)x(n-r^)  nonsingular 
matrices  such  that 


S2A12T2 


(2.2.4) 


where  la  a  r^xr^  nonsingular  matrix.  Define  the  state  space  trans¬ 
formation 


(2.2.5) 


Then  (2.2.3)  becomes 


ill 

c®  * 

r_n 

r  ■ 

y 

Fn  Fu  :  Fi2 :  0 

y 

Gi 

;1 

y 

m 

fu  Fu •:  0  ;  0 

-1 

y 

+ 

h 

*  ^  * 

'  i 

y 

A21  A21  :  A22  :  A23 

y 

B2 

.1 

•  -y 

X 

r  J 

_A31  A31  :  A32  :  A33 . 

X 

rJ 

kJ 

u 


or 


(2.2.6) 


i 


1 


.2 

y 


+ 


u 


(2.2.7) 


This  completes  the  second  step  of  chained  aggregation.  The  algorithm 

2 

terminates  if  A^^O  or  d(xr)*0.  If  these  conditions  do  not  hold,  the 
third  transformation  is  defined  analogous  to  the  second  by  replacing  in 
(2.2.4)  with  A^*  'The  algorithm  then  proceeds  until  on  the  ith  step 

l 

Ai  i+l"°  or  Ic  ia  «asy  to  see  the  £<n.  Then  each  transformation 

as  described  in  (2.2.5)  will  be  called  one  step  of  chained  aggregation. 

After  i  steps  of  chained  aggregation,  (C.1)-(C.2)  has  the  form 


(2.2.8) 


y  *  [H^  0  . 0]x^. 


.i 

x 


.1 

y 


.2 

y 


.i 

y 


•  i 
x 

r 


11 


21 


i,l 


i+1,1 


12 


22 


23 


i ,  i+1 


A 


i+1, i+1 


x1  + 


i+1 


Note  that  the  transformations  are  chosen  such  that  7?  [F  .  ] *  0  for 

J 

j *  1, . . . ,i-l.  In  (2.2.8)  we  identify  two  interconnected  subsystems;  the 
1-th  aggregate  given  by 


1 

*<- 

_ l 

"Fu 

F12  ° 

1 

1 

o 

_ 1 

0 

.2 

y 

F21 

F22  F23 

o  : 

*1  * 

* 

■  • 

•  • 

•o 

yi  + 

u  + 

‘•Ft-i,i 

0 

.1 

Lyi. 

F 

r 

r. 

I 

L  i,l 

L 

iJ 

L  J 

*  F^y  +  G^u  +  E^w 
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and  the  1-th  residual  given  by 


.1  1  . 
x  **  A...  .,,x  + 
r  1+1, 1+1  r 


^Ai+l,l . Al+l,l*yi  +  Bi+lu> 


w 


F 

1,1+1 


(2.2.10) 


If  in  (2.2.8)  i+^"0  then  chained  aggregation  terminates  and  we  say  that 

the  system  (2.2.8)  aggregates ♦ 


I 


i 
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CHAPTER  3 

GEOMETRY  OF  THE  GHR 

3.1.  Introduc  t ion 

In  this  chapter  we  present  the  first  set  of  fundamental  results 
concerning  the  geometrical  interpretation  of  the  GHR.  In  particular,  in 
this  chapter  we  will  be  concerned  with  open  loop  geometry  of  the  GHR.  That 
is,  throughout  this  chapter  we  will  assume  that  u(t) *  0.  We  identify  a 
nested  set  of  subspaces  that  are  intimately  related  to  the  observability 
structure  of  the  system  (not  surprisingly) .  The  second  section  is  devoted 
to  an  algebraic  characterization  of  these  subspaces  and  their  relation  to 
some  system  invariants. 

In  the  third  section  we  introduce  a  topological  characterization 
of  these  subspaces.  This  is  based  on  some  results  by  Stewart  [26]  so  we 
review  them  there.  Basically,  this  analysis  allows  us  to  relax  the  subspace 
containment  condition  which  permeates  geometric  system  theory.  For  example, 
a  system  is  unobservable  if  7[[C]  contains  an  A-invariant  subspace.  Our 
characterization  allows  us  to  say  that  the  ?][C]  Is  near  an  A-invariant 
subspace.  It  is  shown  that  this  characterization  is  related  to  the  canonical 
angles  between  subspaces  giving  it  a  very  geometrical  flavor. 

This  analysis  allows  us  to  measure  the  degree  of  observability. 

This  is  a  long-standing  problem  [13-19]  and  we  relate  our  measure,  called 
near  unobservability,  to  several  other  well  known  measures  in  Section  3.4. 

It  turns  out  that  nearly  unobservable  systems  are  characterized  by  1)  a 
certain  geometrical  relation  between  observability  subspaces  and  the  A- 
invariant  subspaces  and/or  2)  separated  eigenvalues.  In  general  these  two 


properties  interact  in  a  complex  way  to  determine  the  structure  of  the 
operator  A.  Separated  eigenvalues  are  a  characteristic  of  singularly 
perturbed  systems  [27]  and  there  is  an  interesting  connection  between 
explicitly  perturbed  systems  and  near  unobservability  [22].  We  examine 
this  in  Section  3.5.  A  simple  example  is  included  in  Section  3.6  to 
illustrate  the  basic  ideas  involved. 


3.2.  Observability 

In  this  chapter  we  will  study  systems  of  the  form 

x  ■  Ax  (3.2.1a) 

y  -  Cx,  (3.2.1b) 

i.e.,  the  observability  structure  of  (C.1)-(C.2).  The  following  definition 
is  a  well-known  characterization  of  observability. 

Definition  3.2.1  [4]:  The  vector  is  an  element  of  £y  the  j-th  unob¬ 

servable  subspace  if  S-x(O)  implies  y(0)  *  y(0)  *•••  =  y^-1^  (0)  *  0.  By 

definition,  £  *  Z.  □ 

o 

If  the  system  representation  contains  an  input,  it  is  assumed  to  be  identically 
zero  for  all  t. 

The  discrete  version  of  Definition  2.1  gives  a  more  intuitive 
characterization  of  the  subspaces  j C  .  We  list  it  here  for  completeness. 

For  the  discrete  version  of  the  development  here,  see  [28  ] . 
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Definition  3.2.1  (Discrete):'  The  vector  £€£  is  an  element  of  £ if  £»x(0) 


and  y(0)  -  •••  -y(j-l)  -0.  £  «£ 


The  following  proposition  gives  a  well-known  characterization  of 


the  subspaces  £^  for  (3*2.1). 


j-1  , 

Proposition  3.2.2  f  5l:  £.•  O  77CCA . 

- -  j 

Proof :  From  (3.2.1b) 


y(0)  -  Cx(0)  -  0 


(3.2.2) 


by  Definition  3.2.1.  Hence,  £^  [C] .  Similarly 


y(0)  ■  Cx(0)  ■  CAx(O)  *  0. 


(3.2.3) 


Since  Definition  3.2.1  requires  that  both  (3.2.2)  and  (3.2.3)  hold,  we  have 


JC2  »7?[Cjnfl[CA]. 

Induction  completes  the  proof. 

By  elementary  properties  we  have  the  following  corollary. 
Corollary  3.2.3  [20]: 


(3.2.4) 


1)  is  a  subspace. 


2>  XjCXJ+1. 


3)  There  exists  an  l<n  such  that  JC  .  Cl  JC  is  proper  for  j  <  l  and  £  *£ 

J  ^  J 


for  all  j  >  l. 


4)  £j  are  invariant  to  a  change  of  basis  in  the  state  and  output  spaces. 


5)  £ n  is  the  standard  unobservable  subspace. 


Proof:  The  first  two  statements  are  obvious.  The  third  statement  follows 


from  the  fact  that  if  £»£..,  then  CAJ£.*0.  Hence,  A.C.C£.  or  £  .  is  A- 

j  J+l  J  J  J  J 


invariant.  This  implies  £^=£^  for  all  i>  j.  Therefore,  the  sequence 


must  be  strictly  decreasing  until  equality  holds.  We  have  l <  n  by  the 


-i 


*w 
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finite  dimensionality  of  z  .  The  fourth  statement  is  easily  proved  by  sub¬ 
stituting  x *  Tx  in  Proposition  3.2.2.  Let  O  be  the  unobservable  subspace 
If  x(0)€£Jl,  then  y^(0)-0  for  all  k£  0.  Thus  £ .  Also  if  x(0)  €$ 
then  y(t)*0  Vt  implying  £  C£^.  Hence,  5)  follows.  o 

By  4)  of  Corollary  3.2.3,  the  subspaces  £ ^  must  be  connected  to  a 
set  of  invariants  for  the  pair  (A,C).  Indeed,  define  the  numbers 


ri  -  d(£*)  -  dCC^i)  i*l ,...,£ 


(3.2.5a) 


Y  -  r. 


(3.2.5b) 


<»i  -  max{a  :  rQ  >  j}  j-1 . y. 


(3.2.5c) 


Then  the  list  (a^,  i€r}  is  the  well-known  observability  indices.  For  a 
proof  of  this  and  a  further  discussion  of  these  indices  see  Wonham  [3  ]. 
The  number  l  is  sometimes  known  as  the  observability  index  [3  ]. 

The  unobservability  subspaces  are  intimately  related  to  the  GHR. 
After  i  steps  of  chained  aggregation  let  the  system  be  represented  as  in 
(2.2.8).  Then  we  have 


Theorem  3.2.4  [  28  ] 


Proof;  The  proof  is  simply  a  specific  basis  version  of  the  proof  of 
Proposition  3.2.2.  The  result  is  trivial  for  i*l  because  of  the  representation 
of  C  in  (2.2.8).  Suppose  the  theorem  holds  for  j«l,...,i.  By  Corollary  3.2.3, 
we  know  that  If  and  x(0)£?l[F^  i+13»  then  yi(0)^o. 

Since  7([F  1  ]  /  0  for  j»l,...,i  (by  construction  of  the  GHR) ,  it  follows 

that 

y(i+1‘j)(0)  -  Fi+1_j(1+2_jy(H)(0)  #0  j-l,...,i.  (3.2.6) 

Thus,  by  Definition  3.2.1,  x(0)  £  £.  .  ,  or7][F  ,  On  the  other 

1+  1  i  ,  1+1  1+1 

hand,  i£  x(0)  €  7J  [  i+^3,  the  same  argument  shows  y^(0)*0  for  J*1 . i, 

and  f1+lCfl[rlil+1]  l»plylngX1+l-»!(Fiji+1l. 

The  GHR  is  a  particular  basis  which  explicitly  displays  the  sub¬ 
spaces  ,  j*l,...,£,  one  subspace  being  identified  at  each  step  of  chained 
aggregation.  In  particular,  it  will  display  the  unobservable  subspace.  This 
has  lead  some  authors  [23,24]  to  suggest  chained  aggregation  with  orthogonal 
transformations  be  used  to  compute  observability  subspaces. 

The  last  remark  made  implicit  use  of  the  fact  that  the  transforma¬ 
tions  in  chained  aggregation  are  not  uniquely  defined.  In  fact,  some  of 
the  work  below  is  aimed  at  using  this  nonuniqueness  in  various  ways.  What 
then  is  unique  about  the  GHR?  The  following  corollary  is  so  easily  proved, 
the  proof  is  omitted. 

Corollary  3.2.5  [29]:  The  numbers  r  ^  ,  defined  in  (3.2.5a),  are  the  dimensions 
of  the  diagonal  blocks  F.  .,  j*l,...,i.  = 


The  subspaces  supply  Che  foundation  of  Che  geometrical  analysis 


Chat  follows. 


3.3.  Near  Unobservabilit} 


3.3.1.  Norms 


In  Che  last  section,  we  introduced  an  algebraic  characterization 
of  the  subspaces  £ .  It  would  clearly  be  useful  if  a  topological  charac¬ 
terization  could  also  be  provided.  The  GHR  provides  an  approach  to  this 
problem.  If  in  (2.2.8)  i+^»0,  then  £^  ■  and  £^  is  an  A-invariant 

unobservable  subspace.  Thus,  if  IF  .  . I  is  "small,"  £  should  be  "near" 

i.  ,  l*rl  i 


an  A-invariant  subspace 


This  idea  was  recently  suggested  in  [30]  but 


no  system  interpretations  were  given. 

To  make  these  qualitative  judgments,  we  need  to  introduce  norms 
into  the  system  representation  (3.2.1).  To  motivate  our  approach  assume  an 
L?-norm  on  the  space  of  output  functions  (y(t),  0<t<T}.  Then 


T  T  T 

8yll2  =  /  yT(t)y(t)dt  *  xT(0)[/  eA  tCTCeAtdt ]x(0) 


=  xA (0)K(T)x(0) 


(3.3.1) 


where  K(T)  is  the  well-known  [31]  observability  gramian.  Since  K(T)  is  a 
positive  semidefinite  matrix,  (3.3.1)  provides  a  direct  relationship  between 
the  initial  states  and  the  corresponding  output  functions.  If  tlx(0)II  =  1  and 
il  vS  is  "small"  then  x(0)  is  said  to  be  "weakly"  observable  [10,17,32].  More 
generally,  we  can  characterize  the  degree  of  observability  via  the  properties 
of  K(T)  [10,13,16,17,32],  However,  we  will  not  pursue  this  approach  directly 
because  of  the  difficulties  of  computing  K(T) . 


17 


The  above  analysis  shows  there  is  a  direct  relationship  between 
the  natural  norm  on  the  space  of  output  functions  and  the  natural  Euclidean 
or  2-norm  in  the  state  space  Rn.  Our  approach  is  to  analyze  the  subspace 
geometry  with  operations  which  preserve  the  two  natural  norms  and  the 
relationship  between  them.  Thus  we  restrict  the  transformations  in  chained 
aggregation  to  be  orthogonal  state  space  transformations.  These  operations 
yield  systems  algebraically  equivalent  to  the  original  system  so  that  all 
the  usual  algebraic  Information  is  preserved.  All  norms  on  Rn  are  the  two- 
norm 

Ixl  ■  +(x*^x)^  (3.3.2) 

and  its  subordinate  operator  norm 

IAI  -  sup  (3.3.3) 

Both  of  these  norms  are  invariant  to  orthogonal  transformations  on  Rn  as  are 
the  essential  properties  of  K(T) . 

To  summarize,  we  shall  measure  "near"  unobservability  by  measuring 
the  "nearness"  of  JC^  to  an  A-invariant  subspace  If.  The  measure  will  be 
given  in  terms  of  certain  properties  of  an  orthogonal  matrix  which  trans¬ 
forms  into^ .  These  transformations  preserve  the  natural  norms  on  the 
output  function  space,  the  state  space,  and  K(T) .  However,  K(T)  need  not 
be  computed.  The  exact  relationship  to  K(T)  will  be  investigated  in 
Section  3.4. 

Implicit  in  the  selection  of  the  norms  above  is  the  issue  of 
scaling.  Since  we  are  not  using  limiting  arguments,  the  relative  magnitudes 
of  various  quantities  are  meaningful.  For  instance,  by  properly  scaling  a 
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basis  and  then  choosing  the  basis-dependent  norm  (3.3.2),  we  can  say  the 
initial  conditions  are  evenly  distributed  over  the  unit  ball.  This  raises 
a  difficult  and  unsolved  problem  which  we  will  not  address  here.  It  is 
only  our  purpose  to  set  up  a  framework  in  which  this  problem  is  easily 
incorporated  (if  there  is  one).  In  any  case,  it  should  be  noted  that  this 
question  plays  a  fundamental  role  in  the  use  of  the  theory  below. 


3.3.2.  Computations 

To  provide  a  precise  characterization  of  the  distance  between 
certain  subspaces,  we  will  use  results  by  Stewart  [26 1  -  There,  in  the 
context  of  numerical  analysis,  Stewart  characterizes  the  nearest  A- 
invariant  subspace  by  describing  the  properties  of  a  rotation  needed  to 
carry  the  given  subspace  into  an  A- invariant  subspace.  To  apply  those 
results  to  our  problems  we  first  suppose  that  (3.2.1)  is  represented,  after 
one  step  of  chained  aggregation  (using  an  orthogonal  transformation)  as 


(3.3.4) 


In  this  representation,  the  natural  basis  for  Rn  yields  an  orthonormal  basis 
for  £^  and  £"^.  That  is 


L 


0 

I 

n-r  J 


(3.3.5) 


is  a  basis  for  R 


n 


1* 


such  that  the  first  r  columns  span  £^  and  the  second  n-r 
Now  an  algebraic  characterization  of  Stewart's  notion  of 


columns  span  £ 


nearness  [ 26  J  is  expressed  as 


V  -  [V  V]  -  LP 
c 


(3.3.6) 


where  fl[V] «  V  is  an  A- Invariant  subspace  and  P  is  an  orthogonal  matrix 
given  by 


[i  ;rr 


T 

(I+P  P) 


(3.3.7) 


where  the  square  roots  are  the  unique  positive  definite  square  roots  of  the 
indicated  matrices.  Thus  P  rotates  to  V* 

A  useful  geometrical  interpretation  enters  through  the  use  of 
canonical  angles. 

Definition  3.3.1  [26,34] :  Let  l(  and  V  be  subspaces  of  Rn  with  orthonormal 

T 

bases  U  and  V,  respectively.  Let  o±  be  the  singular  values  of  U  V.  Then 
the  canonical  angles  between  V.  and  V  are  the  numbers 

.  -1 

=  cos  a^.  o 

To  apply  this  definition  to  and  V,  we  first  obtain  orthonormal  bases  from 
(3.3.5)— (3.3.7) .  Thus  we  have 


T  T  ”  '/i 

L^V  -  (I+PF) 


(3.3.8) 


Let  P  have  singular  values  a^.  Then  the  canonical  angles  between  and*y 


are  given  by 


-1  2  ~Vz 

9^  *  cos  (1  +  a^) 


(3.3.9) 


It  follows  that 


a .  *  tan  9  . . 
i  i 


(3.3.10) 


By  choosing  P  in  (3.3.7),  P  rotates  to  a  subspace  V  and  the  canonical 


angles  are  directly  related  to  o(P).  For  future  use,  let 


0  -  diag(6^) 


cos  0  -  diagCcos  0^) 


and  similarly  for  sin  0  and  tan0.  Then 


I  Pi  ■  I  tan  01 


IP(I+PTP)~^I  -  I  sin  01 


I (I+PTP)  ^  |  «  I  sin  01 


(3.3.11) 


(3.3.12) 


See  [26  ]  for  a  more  complete  geometric  discussion. 

Thus  far  we  have  translated  the  problem  of  finding  subspaces  near 
£ ^  to  choosing  a  matrix  P  in  (3.3.7).  If  the  rotated  subspace  Y  is  A- 
invariant,  then^'*’-1-  A^(cy).  In  the  matrix  form  (3.3.6) 

VTAV  -  0.  (3.3.13) 

c 

Substituting  (3.3.7)  in  (3.3.2)  and  (3.3.6)  and  using  the  representation  of 
A  in  (3.3.4),  (3.3.13)  becomes 

(I  +  PPT)  /2  (A1P-PA4  +  A2-PA3P)(I  +  PTP)”1/l»  0.  (3.3.14) 

Thus,  if  V  is  an  A-invariant  subspace  then  P  in  (3.3.7)  must  be  a  root  of  a 
Riccati  equation  (3.3.14),  which  we  write  as 

AjP  -  PA4  *  PA3P  -  A2>  (3.3.15) 


The  exact  solution  of  (3.3.15)  is  known  [35],  but  it  requires  knowledge  of 
the  eigenvectors  and  eigenvalues  of  A.  Instead  we  will  use  a  bound  on  P 
given  in  terms  of  the  coefficients  of  (3.3.15).  Stewart  [36]  has  done  a 


careful  analysis  which  goes  like  this.  First  note  that 


T(P)-A]P-PA4  (3.3.16) 

is  a  linear  operator  in  P.  Hence,  if  A^  is  snail  enough  the  quadratic  term 
in  (3.3.15)  can  be  neglected  and  we  obtain  a  bound  from  the  approximate 
linear  equations 

IPI  <IT_1I*»A2I.  (3.3.17) 


The  exact  statement  is  as  follows : 
Theorem  3.3.2  [26,36]:  Let  6  — 1|  T  *fl 


n-|AJ.  Then  if 


(T.l) 


there  exists  a  matrix  P  such  that 


IPI  <  .  (T.2)  c 

We  note  for  future  use  that  if  A^  ■  0  ((3.3.15)  is  a  Lyapunov  equation)  (T.l) 
is  always  satisfied  so  that  bound  (T.2)  always  applies. 

From  the  preceding  analysis  we  immediately  have  the  following 

theorem: 

Theorem  3.3.3  [26]:  Assume  that  Theorem  3.3.2  holds.  Then  there  exists 
a  P  satisfying  (T.2)  such  thatV  *fl[V]  is  an  A-invariant  subspace. 

Furthermore, 


1(A)  *  A (A ' ) u  A (A ! ) 

i  4 

where 

1(a|)  =■  x[(i  +  ppT)/j  (a1-pa3)(i  +  ppt)  ,/j  ] 

X  (A’ )  -  X  [  (I  +  PTP)’ 1/3  (a4  +  a3p)  (x  +  p  tp)  1/3  ] . 


Theorems  3.3.2  and  3.3.3  give  an  estimate  of  the  nearness  of  ^  to  f. 
The  results  on  eigenvalues  in  Theorem  3.3.3  are  obtained  by  simple  algebraic 
manipulation.  It  conveys  the  intuitive  idea  that  if  I  PI  is  small,  the 
eigenvalues  of  and  approximate  the  eigenvalues  of  A.  For  a  detailed 
discussion  of  this  point,  see  Stewart  [26}. 


3.3.3.  Subspace  topology 

Thus  far  we  have  characterized  the  relation  between  two  subspaces 
and  If  in  terms  of  a  rotation.  These  ideas  are  related  to  the  following 
subspace  topology. 

Definition  3.3.4  [  6 ]:  Let  “U  and  V  be  subspaces  of  Rn.  The  gap  between 

and  V  is  the  number 

x  CU,1r)  m  max{  sup  inflv-ul,  sup  inflv-ul}.  o 

I  ul  «1  vg  V  I  v|  -1  u6 

u£  U  v€  IT 

Since  we  are  using  the  two-norm,  the  gap  function  x  is  a  metric  which  also 
preserves  dimension.  See  [  6 ]  or  [26]  for  further  properties.  The  gap 
function  is  related  to  canonical  angles  as  follows.  Let  the  canonical 
angles  between  "U  and  V  be  9^. 

Proposition  3.3.5  [26]'-  tCU.V)  «  Jsin  0B  .  r. 

Hence,  if  the  canonical  angles  between  two  subspaces  are  small,  they  are 
close  in  the  gap  topology. 

With  this  background,  we  are  ready  to  introduce  and  discuss  near 

unobservability.  Let  z  >  0  be  given. 

o 

Definition  3.3.6  ( 33 ] :  If  for  any  i <  l 


xU:.  .“JO  <  e 


for  some  A- invariant  subspace  V,  then  we  say  the  subspace  is  nearly 
unobservable.  a 

Selection  of  the  number  cq  simply  means  that  in  the  context  of  a 
particular  problem,  we  are  judging  the  subspaces  and  7"  to  be  close  in  the 
gap  topology.  What  is  small  involves  a  number  of  issues  Including  scaling. 
However,  as  we  discuss  near  unobservability  in  the  context  of  other  observ¬ 
ability  measures,  as  we  will  do  below,  various  criteria  for  judging 
"smallness"  will  emerge. 

At  first  glance.  Definition  3.3.6  appears  to  be  hrrd  to  use  because 
of  the  abstract  nature  of  all  quantities  involved.  However,  from  Proposition 
3.3.5  and  (3.3.12),  we  have  that 

r(£t,V  -  Isin  01  <  I  tan  01  -  I  PI .  (3.3.18) 

So  it  is  enough  to  estimate  I  PI.  In  fact,  the  discussion  leading  up  to 
Theorems  3.3.2  And  3.3.3  set  up  the  framework  for  estimating  t  (£  ^ , V) .  The 
calculations  are  given  in  Theorem  3.3.2.  The  estimates  are  obtained  from 
submatrices  read  off  from  the  GHR.  Thus  by  imposing  some  more  structure  on 
the  GHR,  we  are  able  not  only  to  explicitly  Identify  the  subspaces  £^,  but 
also  estimate  the  distance  to  A-invariant  subspaces  directly  from  the  GHR. 


3.3.4.  Ir terpretations 

Assuming  that  (T.l)  is  satisfied,  B PH  (measuring  the  nearness  of 
£^  toV)  depends  on  two  quantities,  y  and  5  which  we  will  discuss  in  turn. 
First,  we  will  see  that  y  is  a  measure  of  the  deviation  of  £^  from  Y.  If 
£^  is  an  A-invariant  subspace,  there  exists  a  matrix  B  such  that 


(3.3.20) 


This  shows  that  if  the  eigenvalues  of  and  are  poorly  separated, 
then  the  bound  (T.2)  is  not  small.  The  converse  is  not  necessarily  true, 
because  the  inequality  (3.3.24)  may  not  be  tight.  The  exact  relationship 
between  A(T)  and  5  is  not  well  understood,  it  being  similar  to  the  relation¬ 
ship  between  eigenvalues  and  singular  values  of  a  real  matrix.  However,  if 
and  A^  are  diagonalizable,  we  have  the  following  result.  Let  be  a 


(3.3.25) 


complete  system  of  eigenvectors  for  end  define 

k(S±)  -  IS±I  -IS”1! . 

Proposition  3.3.7  [261* 

6  >  - — -  ...  .  D  ala  |  X(T)  | . 

*c(S^)<(S^)/mln(r,n-r) 

In  the  special  case  when  and  are  Hermetlan,  is  a  unitary  matrix. 
Hence  kCS^)*!  and  5  is  directly  related  to  eigenvalue  separation.  Following 
Stewart  [261  we  will  use  the  notation  S  ■  sepCA^.A^) . 

In  this  section  we  have  discussed  an  explicit  measure  of  the 
distance  from  jC^  to  an  A-invarlant  subspace  V  using  the  GHR.  Because  of 
the  lower  block  Hessenberg  structure  it  is  trivial  to  extend  this  analysis  to 
any  subspace 


3.4.  Weak  Observability 

3.4,1.  Observability  gramian 

In  this  section  we  will  relate  near  unobservability  to  more  well 
known  notions.  We  begin  with  the  widely  used  measures  of  the  degree  of 
observability  via  the  observability  gramian  [10,13,16,17,32].  We  will  restrict 
ourselves  to  systems  (3.3.4)  which  are  stable  and  assume  T * 00  in  (3.3.1). 

In  this  case  K(»)  *  K  is  the  solution  to  the  Lyapunov  equation 

KA  +  ATK  =*  -CTC.  (3.4.1) 

The  measures  on  K  may  be  interpreted  as  follows.  Let  the  singular  value 
decomposition  for  K  be 


U!  “= 

KJL0  hi 


1  -  K 


(3.4.2) 


where  are  real  diagonal  matrices  with  elements  o^. 

Definition  3.4.1:  If 

i /a^  »  Ja^ 

for  all  j,j',  then  we  say  the  subspace  *  fl[U2].is  weakly  observable.  o 

By  introducing  the  topology  of  the  last  section,  we  could  extend  this  notion 
by  including  all  subspaces  in  the  neighborhood  ofl^*  Also,  other  measures 
on  Z^  and  Z^  can  be  used. 

Assume  that  the  system  is  represented  as  in  (3.3.4)  and  that  K 
has  been  computed  from  (3.4.1)  with  respect  to  the  basis  in  (3.3.4).  Consider 
V ■  P  in  (3.3.7)  to  be  a  state  transformation 


"I-  pPl.Pll  P 12  ~|  H 

Cr .  .V  _P21  P22.  ,x 


(3.4.3) 


Furthermore,  assume  that  is  nearly  unobservable  and  that  P  has  been  chosen 
in  (3.4.3)  to  carry  X ^  into  an  A-invariant  subspace  IT  .  Note  that  from 
(3.3.7)  and  (3.3.12),  I  P^H  **  J  cos  01  and  5  P^  *  I  sin  0J  .  Since  X^  is 
nearly  unobservable,  II P^  is  small. 

Compute 


-T  -  ^1  ° 

P  AP  =  =  A 

Js  v 


(3.i.4a) 


CP  '  !C1P11  ClPn!  ’  E 


(3. a. 4b) 


-T  —  T-T  -  -T- 

P  KPA  +  A  P  KP  *  -C  C 


(3.4.4c) 


27 


where 

-  K.  (3.4.5) 


Then  from  (3.4.4c)  we  obtain 


Kl^l  +  ^1K1  “ 

-pLcicipu 

(3.4.6a) 

K-A,  +  A-  - 
2  4  12 

■P11C1C1P21  *  VS 

(3.4.6b) 

K,A.  +  A.K,  - 
3  4  4  3 

T  T 

_P21C1C1P21' 

(3.4.6c) 

What  we  would  like  to  show  is  that  K  is  nearly  a  block  diagonal 
matrix  and  that  the  spectra  is  separated.  This  implies  thaty  is  a  weakly 
observable  subspace  and  so  relates  near  unobservability  to  weak  observability. 
The  approach  is  to  apply  Theorem  3.3.2  to  (3.4.6).  This  will  give  conditions 
under  which  the  spectra  is  separated  and  K  is  almost  block  diagonal. 

Motivated  by  this  observation  we  apply  Theorem  3.3.2  to  (3.4.6c) 

to  obtain 

21 C  l2|sin  01  2 

,  K.l  <  - ? -  (3.4.7) 

3  64 

6^  -  sep(Aj,-A4). 

Let  be  a  complete  eigensystem  for  A^ .  Then  if  we  have  from 

Proposition  3.3.7  that  <5^  is  approximately  twice  the  smallest  eigenvalue  of 
A^ .  So  if  the  eigenvalues  of  A^  are  all  large  and  since  3  sin  Gil  is  small, 
it  follows  that  B  K^B  (=  largest  eigenvalue)  is  also  small. 

The  next  step  is  to  bound  the  lower  eigenvalue  of  to  show  that 
the  spectra  of  and  are  separated.  We  can  do  this  using  the  results  in 


-T  - 
PKP 


K1  *2 
lkI  s 


[38].  To  this  end  let  o(X)  denote  the  smallest  singular  value  of  X.  Then 
for  (3.4.6a)  we  have 


£(*!> 


T  -l/a  2 

oXC^I+PP1)  '  ) 
21  A^l 


(3.4.8) 


Thus  if  lA^I  is  small  (implying  small  eigenvalues)  and  (3.3.4)  is  nearly 
unobservable  (implying  o^(P^)  »  a^(cos  Q)  ®*  1) ,  then  £(K^)  will  be  large. 

Taken  together  with  (3.4.6),  this  implies  a  separation  in  the  spectra  of 
and  K^.  It  follows  from  Proposition  3.3.7  that  * sep(K^,K^)  will  be  large. 

Finally,  we  compute  a  bound  for  K^.  Again  we  apply  Theorem  3.3.2 
to  (3.4.6b)  to  obtain 


I K-l  <  ~  [2lA,l  *8  K,l  +  lC.II2isin  01,-llcos  08  ] 

2.  6  J  J  1 

6  ■  sep(A?,-A. ). 

X-+  X  H 


(3.4.9) 


This  bound  contains  some  interesting  information.  For  IIK^II  to  be  small  we 
must  have:  (1)  (3.3.4)  is  nearly  unobservable  (8  sin  08  is  small),  (2)  6 
is  large  (X(A^)  and  X(A^)  are  separated).  (3)  is  small,  and  (4)  BA^II 

is  small.  The  first  two  conditions  entered  into  determining  the  near 
unobservability.  The  third  condition  relied  on  the  fact  that  A^  contained 
large  eigenvalues.  Hence,  the  separation  between  X(A^)  and  X(A^)  must 
occur  in  a  special  way.  Finally,  if  IIA^U  is  small  then  (3.4.4a)  is  almost 
block  diagonal.  This  is  interpreted  as  meaning  that  Tr"  is  near  an  A- 
invariant  subspace  (apply  Theorem  3.3.3!).  This  imposes  additional 
structure  on  the  eigenstructure  of  A  and  represents  the  difference  between 
weak  observability  and  near  unobservability. 


I 


4.2.  Trajectory  Analysis  _ 

Another  way  of  interpreting  near  unobservability  is  to  consider 
the  evolution  of  the  state.  Intuitively,  we  can  think  of  the  state 
evolving  on  two  A-invariant  subspaces,  and  *tf ^ •  Furthermore,  suppose  If  ^ 
is  near  jC^»  TjC].  Thus,  the  projection  of  the  states  in  on  output 
is  small  so  that  they  should  be  "nearly  unobservable." 


We  can  make  these  ideas  precise  by  again  considering  the  trans¬ 
formed  system  in  (3.4.4a) , (3.4.4b) .  Given  an  initial  condition,  the  states 


evolve  as 


A^t 

y(t)  =*  e  1  y (0) 


(3.4.10a) 


A,  t  t  A, (t-T)_  A  (t) 

<  (t)  *  e  4  x' (0)  +  /  e  4  A»e  1  y(0)dr 
r  r  o  J 


Thus,  from  (3.4.4b)  the  output  evolves  as 


(3.4.10b) 


y(t)  -  C^P^U)  +P12xr(t))  . 


(3.4.11) 


From  (3.4.10)  and  (3.4.11)  we  can  get  an  estimate  of  the  contribution  of 


x  (t)  to  the  output  y(t).  First,  let 

- 

Xi«-X<  2  >-  i*1'4' 


(3.4.12) 


Then  it  is  shown  in  [39]  that 


A . t  A.  t 

U  1  1  <  e  lm 


i  =  1,4. 


(3.4.13) 


A,  t  t  A,  (t-r)  A  t 

8x'(t)l  <  e  ‘+m  ax' (0)n  +  HAS  *lly(0)t!  f  e  4m  e  m  dx 

r  j  0 

A  t  n  A  S  •«  y  ( 0 )  I  A  t  C 

<e  II  x  (0) S  4 — p - 7 r  (e  -e  )  . 

r  !’,'lm'\m1 


(3.4.14) 


30 


Combining  (3.4.14)  with  (3.3.10)  we  have 


xi»‘ 


X4mc 


I  y(t)l  <  IC^  [IPUI  -I  y(0)l  e  +  IP12I  [«  lir(0)l 

.  'V-'j®11  ,  V  Xlac  i , , 

‘W1,  'e 


(3.4.15) 


The  estimate  in  (3.3.14)  is  good  for  each  value  of  t  whereas  the  analysis 
by  observability  gramian  is  the  integral  of  the  square  of  (3.4.16). 

This  bound  again  shows  all  the  structure  of  weakly  observable 

systems. 


3.5.  Singular  Perturbations 

It  is  clear  that  near  unobservability  is  closely  related  to  time 
scale  separation.  It  enters  through  the  dependence  of  6  on  eigenvalue 
separation  in  Theorem  3.3.3.  As  an  application  of  the  ideas  in  the  previous 
sections,  we  will  discuss  near  and  weak  observability  in  singularly  perturbed 
systems.  In  this  section  we  will  consider  systems  of  the  form 


(3.5.1a) 


(3.5.1b) 


First,  we  would  like  to  compute  the  asymptotic  eigenspaces  of  the 
slow  and  fast  modes  of  (3.5.1a).  We  use  Theorem  3.3.2  to  see  how  near  the 
eigenspace  of  the  fast  modes  is  to 


From  Proposition  3.3.7  we  have 


sep  (A^  -~)  -  j  sep  (eA^A^  -  j  6{ 


-7D,eXi“xj 


(3.5.3) 


for  some  X^€  X(A^)  and  X^eX(A^).  Then  from  Theorem  3.3.2  (T.l)  becomes 


_  IAJIAJ  2 

1—  <  - - -  •  ~~z  -*•  0  as  e-*>0. 

6  e  6 
e  o 


(3.5.4) 


So  for  small  enough  e,  Theorem  3.3.2  applies  and  P  is  bounded  as 


as  e>0. 

e  °o 


(3.5.5) 


Hence,  the  fast  eigenspace  tends  to  V  (3.5.2)  as  e-»0  by  Theorem  3.3.3. 

A4 

Also  note  that  from  Theorem  3.3.3,  the  fast  eigenvalues  tend  to  X(— )  since 

!!  p||  -  o- 

To  estimate  the  slow  subspace  V  ,  we  chose  P  to  place  a  0  in  the 

s 

(2,1)  block  of  (3.5.1a).  The  necessary  condition  (3.5.4)  still  holds  so  that 
an  estimate  for  |!  P  ||  is 


s  6  6 

o  o 


H  A3  !l  as  t  -*■  0 

!|  a4| 


(3.5.6) 


from  (3.S.3).  From  (3.5.6)  we  might  guess  that 


T  -1 
p  m  A  A 

e  a4  a3 


so  that 


u 


and  the  slow  eigenvalues  are  tending  to  X (Aj-AjA^A-)  from  Theorem  3.3.3. 
(We  are  rotating  into  the  subspace  spanned  by  the  first  r  columns  of  P  in 
(3.3.7).  This  requires  that  Theorem  3.3.3  be  modified  as  X (A)  »  X (A^-A^^) 
u  X^y+P1^)  -) .  This  is  confirmed  in  [27], 


Now  if  X(A  )  and  X(—  A.)  are  both  stable  for  small  enough  e,  can 
o  e  4 

these  results  be  related  to  weak  observability?  In  fact,  we  can  apply  the 
analysis  of  Section  5.4  almost  directly.  Now  C  in  (3.5.1b)  replaces  C  in 
(3.3.4).  Then  C  in  (3.4.4b)  becomes 

C=  [Cj^d+PP1)  /J  +C2P(I+PPT)  1/7  ,  CjPd+P1?)”1^  +C2(I+PTP)  ,/jJ-  (3.5 


This  causes  the  bound  for  SK_S  to  become 
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IK3I  S.  j-  tlC1l2|sin0l2  +  2lC1l -iCjllain  01-lcos  0l+IC2l2lcos  ©I 2] 

4 

-  j-  [IC.IIsin  01 +lCjlcos  0I]2.  (3.5.9) 

4  1  1 

Ic  is  easily  seen  chat  6  as  e+O  so  that  eventually  the  bound  in  (3.5.9) 
becomes  useful.  However,  a  strong  component  of  the  fast  modes  in  the  output 
(I C^l  large)  requires  that  the  fast  modes  be  sufficiently  fast. 

The  lower  bound  on  the  eigenvalues  of  is  only  increased  by  the 
presence  of  the  fast  component  in  the  output. 

From  (3.4.9)  the  bound  on  K2  becomes 


1 K  I  <  ~~  [21 A  _  I  *  I K  I  +  {  [I  C,  ■  2  Icos  01  -I  sin  01 
2  fi14  3  3  1 

+  I C^l I C2I  [I cos  01+1  sin  01]}].  (3.5.10) 

It  can  be  shown  that  behaves  as  as  e  •+•  0 .  It  is  also  easily  seen 

that  as  e-*-0.  In  fact,  it  is  not  hard  to  show  that 


-  I  A”1!  -IA3I 


as  e-*-0. 


(3.5.10) 


From  (3.5.8)  we  know  that  JK^  -*■  0  as  e-*0  so  that  the  first  term  on  the  RHS 

of  (3.5.9)  goes  to  zero  as  e -*■().  Note,  however,  that  the  convergence  is  much 

faster  if  If  i If  f  or  nearly  so.  This  asymptotic  limit  is  a  property  of 
s  c 

(3.5.1)  for  e  *  1. 

The  second  term  in  (3.5.9)  also  goes  to  zero  as  e  0  because 
H  cos  Gil  and  I!  sin  GH  are  bounded  by  1.  However,  the  presence  of  the  fast 
variables  in  the  output  again  slows  down  the  convergence  because  this  term 
contributes  a  constant.  Indeed,  if  IIC^II  is  large,  fast  dynamics  are  again 


indicated  for  the  fast  states  to  be  weakly  observable.  Similar  results  were 
obtained  in  (40]  in  a  closed  loop  context. 

The  analysis  in  the  last  two  sections  clearly  shows  the  connection 
between  near  unobservability  and  time  scale  separation.  Since  the  computa¬ 
tions  do  not  depend  explicitly  on  e,  this  may  be  a  useful  method  for  identi¬ 
fying  time  scales. 


3.6,  A  Simple  Example 

In  this  section  we  will  consider  the  following  second  order 

system 


(3. 6.i: 


which  is  already  a  GHR. 


Then  if  follows  from  Theorem  3.2.4  that 


(3.6.2] 


The  eigenvalues  are  ,\  *a  and  \»=d  with  respective  eigenvectors 


rl‘[° 


(3.6.3a) 


r2  -  d-a 


(3.6.3b) 


To  apply  Theorem  3.3.2,  first  note  that  transformation  P  in  (3.3.7) 


is  given  by 


1  F 


-P  1 


(1+pV1/?  0 

0  (1+P2)~1/2 


cos  9  sin  9 


-sin  9  cos  9 


(3.6.4) 


where  9  is  the  angle  between  the  x2-axis  and  the  second  column  of  P. 
Now  in  this  example  the  quantities  in  Theorem  3  3.2  are  computed  as 


follows: 


6  -  d-a 


Y  *  b 


n  *  0 


(3.6.5) 


-a : 


Here  (T.l)  holds  trivally.  (In  general,  (T.l)  applied  to  a  second  order 
system  is  just  the  condition  that  the  system  have  two  real  roots.)  Now 
compare  x^  in  (3.6.3b)  to  the  second  column  of  P  in  (3.6.4)  to  the  bound  (T.2) 
in  (3.6.5).  This  clearly  shows  the  two  characteristics  of  a  nearly  unobserv¬ 
able  system;  i.e.,  a  system  in  which  r^  is  close  to  The  first  is  the 

geometric  relationship  between  r ^  and  £ ^  which  is  measured  by  jb|. 

This  quantity  is  unrelated  to  the  eigenvalues  and,  in  fact,  can  be  altered 
by  scaling.  The  second  characteristic  is  the  time  scale  separation  which 
is  measured  by  |d-a  |.  As  is  well  known,  this  quantity  is  unrelated  to  the 
spatial  distribution  of  the  eigenvectors  and,  in  fact,  is  invariant  to 
scaling.  Here  enters  the  connection  to  singular  perturbations. 

To  discuss  weak  observability,  consider  the  system 


a  e  y 


b  d  x 


(3.6.6) 


y  -  [1  0] 


where  e  is  small.  If  g*0  then  the  eigenvalues  and  eigenvectors  would  be 


'i  ■  «•  ri 


(3.6.7) 


X2  "  d’  r2 


« 
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By  continuity  arguments,  for  mall  e  we  can  consider  (3.6.7)  to  be 
approximations  to  the  true  quantities. 

Clearly,  the  system  is  nearly  unobservable  for  small  e  since 
r2  is  near  £^.  Note,  however,  that  if  b  is  large  then  r^  is  also  near  £^. 

It  is  in  exactly  this  situation  that  a  nearly  unobservable  system  is 
not  weakly  observable.  This  can  easily  be  seen  by  computing  the  observability 
gramian  from  (3. A. 6).  Using  the  fact  that  *  cos  0  and  P2^  ”  a^n  ® 
we  have 


-  sin20 

-  sin20 

*  2d 

2 

-  cos  0 

2 

-  COS  0 

(3.6.8) 

2Xi 

=  2a 

. b(sin20), 

k_  -  — —  (cos  0  sin  0  +  — r - .)  . 

Z  at  q  a  2 


Using  Theorem  3.3.2  we  can  estimate  p  as 


P  -  — T 
r  a-d 


(3.6.9) 


Next,  doing  the  computation  in  (3.4.4a),  we  have 

v  .  2 

b  -  — +  aP  ~  dP-~  j  £  .  (3.6.10) 

l+p“  1  +  p 

To  review,  we  used  the  transformation  in  (3.6.4)  to  measure  the 
near  unobservability  of  (3.3.6).  The  estimate  of  p  is  given  in  (3.6.9)  which 
we  assume  is  small  because  e  is  small  and  a-d  is  large.  Because  p(*  tan  0)  is 
small,  in  sin  0  ^0  and  cos  3  =1.  Then  from  (3.6.3)  it  follows  that  k  is  small 


small  and  k^  is  large.  This  is  one  of  the  characteristics  of  weakly  observ¬ 
able  systems.  Next  note  that  bs»b  from  (3.6.10)  and  **  d  because  c  is 
small.  Thus,  k£  in  (3.6.8)  will  not  be  small  if  b  is  too  large  and  so  the 
system  will  not  be  weakly  observable.  In  this  case,  (3.6.7)  shows  that  r^  is 
close  to  r^>  This  is  the  geometric  structure  of  systems  which  are  nearly 
unobservable  but  not  weakly  observable. 
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CHAPTER  4 

CLOSED  LOOP  GEOMETRY 


4.1.  Int  roduc  t Ion 

It  Is  obvious  from  the  GHR  (2.2.8)  that  the  open  loop  unobservable 
subspaces  £ can  be  modified  by  state  feedback  [28].  Indeed  we  shall  base  our 
design  procedures  on  appropriately  modifying  these  subspaces.  In  this 
chapter  we  shall  investigate  the  role  of  the  subspaces  jC^  in  a  closed  loop 
setting.  This  involves  two  kinds  of  subspaces:  (A,B)-invariant  subspaces 
[  3  ]  associated  with  the  state  feedback  law  Section  4.2.  and  (C. A) -invariant 
subspaces  [72]  associated  with  the  observer,  Section  4.3.  Two  applications 
are  included  in  Section  4.4. 


4.2.  (A, B) -Invariant  Subspaces 

4.2.1.  Definitions 

In  this  section  we  will  study  the  geometry  of  discrete  systems  under 
the  action  of  a  state  feedback  law,  i.e. 


x(k+l)  *  Ax(k)  +  Bu(k) 


y (k)  -  Cx(k) 


u(k)  *  Lx(k)  +  v(k) , 


(4.2.1a) 

(4.2.1b) 

(4.2.1c) 


The  reason  for  working  with  discrete  systems  is  that  we  will  obtain  very 
nice  dynamical  interpretations  of  certain  subspaces  which  do  not  exist  for 
continuous  systems.  However,  the  results  hold  for  continuous  systems  and 
there  is  no  loss  of  generality  here. 

With  respect  to  the  closed  loop  system  (A-rBL,B,C),  defined  in 


(4.2.1),  we  have 


r 


re 
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Definition  4.2.1  [28]:  The  vector  £€%  is  an  element  of  the  1-th  L- 


unobservable  subspace,  £y  if  x(0)  ■  C  and  v(0) 


v(j-l) »  0  implies 


y(0) 


•  y(j-l)  -  0.  By  definition,  £^*Z. 


Note  that  are  defined  with  respect  to  a  fixed  L.  For  different  feedback 
laws  we  obtain  different  sets  of  subspaces,  £ Indeed,  if  L = 0  we  obtain 
the  unobservable  subspaces  of  Section  3.3.2.  The  subspaces  clearly 
satisfy  the  same  properties  as  in  Corollary  3.2.3,  l)-4).  Property  5)  is 
modified  as  follows: 

Corollary  4.2.2:  jC^  is  an  (A,B)-invariant  subspace  in7I[C]. 

Proof :  From  the  proof  of  Corollary  3.2.3,  it  follows  that  is  an  (A+BL)- 
invariant  subspace  which  is  exactly  the  definition  of  an  (A.B)-invariant 
subspace.  Also,  £^c7l[c]  since  by  Corollary  3.2.3,  2),  =*  7?[C]  (from 

Definition  4.2.1).  □ 


4.2.2.  Maximal  L-unobservable  subspaces 

We  can  characterize  the  subspaces  X^  by  describing  the  effect  of 
the  control  on  the  subspaces  X^ .  More  precisely,  we  will  characterize  the 
effect  of  the  control  on  the  super-diagonal  blocks  F^.  in  the  GHR, 
because  these  blocks  govern  the  relationship  between  £ ^  and  .  We  will 
first  describe  a  set  of  X^" 's  which  we  show  below  are  in  a  certain  sense 


maximal.  From  properties  of  these  subspaces  we  will  derive  other  sets  of 
L 

j 


XL,s. 


The  procedure  starts  by  applying  one  step  of  chained  aggregation 
to  (4.2.1).  The  result  is 


j 
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[i(w,i.f'u  vu*,+m 

x^(lcfl)  J  [a21  A22J  LB2iJ 


u(k). 


(4.2.2) 


y(k)  -  [H. 


0]x1(k) 


We  can  identify  the  control  affect  on  the  information  structure  of  (4.2.2) 
by  identifying  flfG^  and  TUG^J.  To  this  end,  there  exists  nonsingular 
matrices  and  such  that 


™-[V :] 


(4.2.3) 


where  G^  is  a  nonsingular  matrix. 


We  embed  in  the  n*n  matrix 


0  I 


(4.2.4) 


and  interpret  it  as  a  state  space  transformation 


y  y 

*  T. 

..  f  „  T 


(4.2.5) 


Similarly  we  think  of  as  an  input  space  transformation 


u  *  V, 


where  the  partitioning  in  (4.2.6)  is  compatible  with  (4.2.3).  Applying 
(4.2.5)  and  (4.2.6)  to  (4.2.2)  yields 


2 


y  (k+l) 


x'(k+l) 

r 


;  U<*>j 

[  y(k)  1 

Lx'(k)J  * 


0  [? 
•  •  Lui< 


(4.2.7) 


We  can  further  isolate  the  effect  of  the  input  on  by  defining 
I  0  0  y1  Ty1 

"I 

0  1  0  y2  *  y2 

m  m  ■*  * 


In-mJ  Lxr. 


(4.2.8) 


W1  ‘  “B31®11* 


Applying  this  to  (4.2.7)  we  have 


y1 (k+l) 


F  ’  F 
11  .  12 


y  (k+l)  -  Fu  .  F12 


XX 

r  y(k)]  r  u  (k)  ■ 

+  0  0 

Lx'ooJ  Lu.  (k). 


(4.2.9) 


*r(k+1)J  LA21  !  A22. 


T  y(k)‘ 

i  :.  0] 

1  L*>)J 


y(k)  *  [5 


Equation  (4.2.9)  shows  explicitly  how  the  control  affects  and 
£  .  First  note  that  £^  =  7J[C]  (from  Definition  4.2.1)  for  any  L  since  the 
state  feedback  does  not  affect  F^?  which  defines  £ In  particular,  note 
that  we  can  choose  such  that 


F. .  +  G. _L*  =  0 


(4.2.10) 


I 


I 


since  is  nonsingular. 
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Then 

define 

the 

*L 

L*  ‘ 

11 

12 

L  ■ 

-L21 

L22. 

(4.2.11) 


where  the  unstarred  entries  are  arbitrary.  Using  (4.2.11)  and  (4.2.9)  with 
Definition  4.2.1,  we  have 

£.\  -  spl 


x 

•  r 


(4.2.12) 


Because  of  the  construction  of  (4.2.9),  the  subspaces  in  (4.2.12)  are 
maximal  (with  respect  to  inclusion)  of  the  subspaces  £^  1*1,2  for  all 

possible  L's.  Henceforth,  we  shall  denote  these  maximal  subspaces  by  £*; 

rL*  A  ^ 

i.e.  ,  £  .  -  £ 

x 


J 


If  in  (4.2.9)  chen£*»£*  and  we  are  done.  If  F^^O*  c^en 

we  repeat  the  algorithm  above  on  (A^, , > F^)  C°  identify  £;£•  includes 

one  step  of  chained  aggregation  plus  transformations  (4.2.4),  (4.2.6),  and 
(4.2.8).  We  shall  call  this  algorithm  Modified  Chained  Aggregation  (MCA). 

At  each  step  the  computations  are  embedded  in  appropriately  defined  state  or 
input  transformations  so  that  the  algorithm  produces  equivalent  system 
representations.  As  described  above,  this  algorithm  was  first  presented  in 

[41].  See  also  [28.29,42]. 

* 

r  * 

It  turns  out  that  the  subspaces  £^  are  well  known.  Consider  the 
following  definition. 
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c 


re 


Definition  4.2.3  [4,20]:  The  vector  £€%  is  an  element  of  the  1-th  weakly 

unobservable  subspace,  V ^  if  x(0)  ■  5  implies  that  there  exists  an  input 


sequence  u(0) 

Y  -  X. 


u(j-l)  such  that  y(0) 


-  y(j-l)  -  0.  By  definition. 


Proposition  4.2.4  [28]: 

Proof :  We  will  establish  Proposition  4.2.4  for  j»l,2.  Induction  will 
complete  the  argument.  Prom  Definition  4.2.3  we  have  V  *7][C],  hence  the 
result  holds  for  j«l.  Now  suppose  £e£*.  Define  the  control 


•  ... 

‘5X(0)' 

*  u(0)  * 

m 

-u.(0). 

0 

(4.2.13) 


_  T 


where  and  L ^  satisfy  (4.2.10).  Then  since  straight- 

■  *  clr 
r~  2 


forward  computation  in  (4.2.9)  yields  y[0,l]*0.  Therefore,  £*CY..  On  the 


2 

other  hand,  suppose  £g£*.  Then  either  y(0)  i  0  or  y  (1)^0.  If  the  latter 
holds  y(l)?*0.  In  either  case.  Definition  4.2.3  fails  no  matter  what  control 
is  selected.  It  follows  that  This  gives  the  result.  o 

It  has  been  shown  [20]  that  Y^=Y*  where  Y*  is  the  supremal  (A,B)- 
invariant  subspace  in7[[C]  [  3  ].  (Indeed,  this  result  can  be  proved  using 
the  construction  given  above.)  Thus  Proposition  4.2.4  serves  to  connect  the 
subspaces  £*  to  the  well  known  subspace  Vk . 

The  subspaces  V  have  been  used  [20]  to  connect  several  well- 
known  algorithms  [3,21,25]  and  so  relates  these  algorithms  to  MCA.  Of  these 
algorithms,  [25]  is  closest  to  MCA  in  that  it  uses  only  input  and  state 
space  transformations.  However,  this  algorithm  was  presented  in  a  numerical. 

not  theoretical,  context.  Furthermore,  all  of  these  algorithms  identify  only 

j,  l 

jC^'s.  We  shall  use  MCA  to  identify  other  sets  of  l^'s. 
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4.2.3.  Generic  cases 

When  MCA  terminates,  the  system  (4. 2. la) -(4. 2. lb)  will  always  have 
the  representation 


y(k+l) 


xr(k+l) 


LBGR 


y(k)  *  [H. 


R 

M 

0] 


y(k) 


Lxr(k) 


y(k) 


xr(k) 


B, 


0 

B, 


RJ 


u(k) 

G(k) 


(4.2.14) 


B 


with  the  properties  that  1)  72 ^ G1;L ]  *  0  and  2)  fl[CRJ  C«t[Gi;L]  (so  there  exists 

state  feedback  to  cancel  C^).  We  shall  frequently  refer  to  this  system 
representation  below. 

Not  every  system  displays  all  of  the  structure  in  (4.2.14).  There 
are  three  basic  variations  which  can  be  illustrated  nicely  by  considering  the 
generic  results  of  MCA.  To  do  this  suppose  that  the  first  transformation  of 
MCA  is  selected  as 


'11 
L  0 


'12 

I 


(4.2.15) 


[C11  C12^ 


whera  is  an  rxr  nonsingular  matrix  •  Then  it  follows  that 

Gi  *  CB 


(4.2.16) 


where  is  found  in  (4.2.2).  Genericallv  the  product  CB.  an  rxm  matrix, 
has  full  rank.  There  are  three  cases. 


This  is  always  possible  by  state  permutation. 


Case  1.  m>  r  -  more  inputs  than  outputs. 

Here  CB  has  full  row  rank  and  a  nonzero  nullspace.  Hence,  in 

(4.2.14) 


(4.2.17) 


Furthermore,  flfF^]  CflfG^]  so  that  all  of  F^  can  be  canceled  on  the  first 
step.  This  implies  and  MCA  terminates  here.  Thus  all  of  the  structure 

displayed  in  (4.2.14)  exists  and  is  nonzero.  o 

Case  2.  m**r  -  equal  number  of  inputs  and  outputs. 

In  this  case  CB  is  a  square  nonsingular  matrix.  We  have  a  similar 
structure  to  Case  1  except  that 


MCA  still  terminates  on  the  first  step  and£*  =  £^.  Now  in  the  system  repre¬ 
sentation  (4.2.14),  the  submatrix  B  does  not  exist.  a 

K 

Case  3.  r>m  -  more  outputs  than  inputs. 

Now  CB  does  not  have  full  row  rank.  Condition  (4.2.18)  holds  here, 
but  in  (4.2.9)  F^  Is  not  canceled.  Furthermore,  all  the  control  has  been 
used.  Thus  MCA  continues  as  chained  aggregation  on  the  pair  (A22’F12^  ^see 
(4.2.9)  where  B^  does  not  exist).  Since  this  pair  is  generically  observ¬ 
able,  we  have,  generically,  £*=0.  c 

n 

This  analysis  establishes  the  generic  existence  of  X*  (as  is  well  known 
[  3  ]).  This  will  also  give  us  a  criteria  for  generic  solvability  of  certain 


problems  discussed  below. 


We  would  like  to  describe  how  to  generate  other  sets  of  L- 


unobservable  subspaces,  but  first  we  need  to  establish  other  properties  of 
the  system  representation  produced  by  MCA.  Let  L*  be  any  feedback  matrix 
which  makes  £*  (A+BL*) -invariant.  Let 


(4.2.19) 


Denote  by  fl*  the  supremal  reachability  subspace  in  7][C]. 

Proposition  4.2.5  [  28  ] :  «*  -  <  A+BL^BJ^ 

Proof :  According  to  Wonham  [  3  ,  p.  113]  <1*  is  characterized  as 
fl*“<  A+BL* |  *2^* ^ tf? ( B ]>  .  So  we  need  to  show  V*n<l[B]  ■  We  proceed  by 

induction.  If  4«1,  then£*-V*.  In  this  case  from  the  input  transformation 
in  (4.2.3)  and  (4.2.6),  it  follows  that 

jC*Dfl[B]  -  Bt^.  (4.2.20) 


If  i  >  1  then  we  have  that  Vi  identifies  jC^nBt^.  Since  •C*D<C£+1»  we  have 
that  x£nfl[B]  D  Bl^.  On  the  other  hand,  if  u^t^  then  Bu^£*  by  MCA.  So 
£*H(R[B]  C  Bl^  and  the  result  follows.  *  o 

MCA  explicitly  identifies  a  feedback  matrix  L*  and  the  subspace 
Hence,  it  also  identifies  ft*.  Indeed,  consider  the  first  generic  case 
of  (4.2.14).  We  have 


«*  -<ar!V- 


(4.2.21) 


In  fact,  this  analysis  further  classifies  the  generic  cases.  For  Case  1. 
we  have  ft*»£*  since  (4.2.21)  is  generically  reachable.  In  Cases  2  and  3, 


In  general,  the  pair  (A^.B^  in  (4.2.14)  will  not  be  reachable. 

We  can  give  the  eigenvalues  of  these  unreachable  modes  a  fundamental  inter¬ 
pretation  using  the  following  definition.  Let  A*«A+BL*|jC*  and  denote  by 
A*  the  map  induced  on  £*/&* . 

Definition  4.2.6  [43]:  The  invariant  zeros  of  (4.2.1a)-(4.2.1b)  are  the 

eigenvalues  of  A*.  ° 

Proposition  4.2.7  [28]:  The  invariant  zeros  of  (4.2.14)  are  the  unreachable 

modes  of  the  pair  (A^,BR). 

Proof:  Because  MCA  generated  the  representation  in  (4.2.14),  by  Proposition 
4.2.4 


£*  -  sp 


r  J 


(4.2.22) 


The  feedback  map  L*  is  essentially  determined  by 


CR  +  BGL2  “  0  (4.2.23) 

(see  (4. 2. 10)- (4. 2. 11))  which  shows  that  A*®^.  By  Proposition  4.2.5,  61* 
is  the  reachable  space  of  (A^jB^).  It  is  a  well  known  [3  ]  fact  that  the 
eigenvalues  of  the  unreachable  modes  of  a  pair  (A,B)  are  eigenvalues  of  the 
induced  map  on  the  factor  space  %(mod  <  A | B > ) .  In  this  case  the  factor 
space  is  £*/6l*  and  so  Proposition  4.2.7  follows  from  Definition  4.2.6.  a 
The  invariant  zeros  provide  the  final  classification  of  the 
generic  cases.  Both  Cases  1  and  3  will  have  no  invariant  zeros  while  Case  2 


will  have  n-ra  invariant  zeros. 
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4.2.5.  Ij*  subspaces 

We  are  now  in  a  position  to  describe  other  sets  of  L-unobservable 

subspaces.  It  turns  out  to  be  most  useful  if  we  concentrate  on  possible 

candidates  for  We  proceed  by  selecting  a  subspace  \'CZ  and  then 

computing  a  feedback  matrix  such  that  £^m9(.  We  first  note  that  Xc  £*  since 

n  i 

£*  is  the  supremal  L-unobservability  subspace.  In  fact,  for  all  i  and 

all  L.  Hence,  the  L*-unobservable  sub spaces  £*  represent  an  upper  bound  on 
all  possible  sets  of  subspaces  £^.  Furthermore,  we  want  X  to  be  an  (A,B)- 
invariant  subspace  of  the  original  system. 


From  Propositions  4.2.5  and  4.2.7,  £*  can  be  decomposed  as 


t-* 


<R*®  £  where  £  is  an  A*- invariant  subspace  associated  with  the 


invariant  2eros.  Since  7CC£*  write  X  m  Xr>£  and  X  m  X^it*-  The  possible 

**  Z  Z  2T 

set  of  subspaces  X  is  then  determined  by  possible  sets  of  X  and  X  •  We 
will  discuss  each  in  turn. 

Assume  first  that  fl*  ■  0  in  (4.2.14)  (B  -0) .  Thus,  the  invariant 

R 


zeros  are  the  set  A. 


<V 


Now  X  mK  must  be  (A.B)-invariant  in  7J[  C  ] .  By  inspection  of 
(4.2.14)  it  is  not  possible  to  alter  the  submatrix  AR  by  state  feedback.  It 
follows  that  Xz  must  be  (A^) -invariant .  Furthermore,  if  X  is  to  be  unobserv¬ 
able,  we  must  have  7J[CD]37('  .  But  we  can  alter  the  submatrix  C_  arbitrarily 

by  state  feedback  (since  d?[C„] Cfl[B„]) .  So  we  have  that  X  can  be  selected 

KG  z 

to  be  anv  (A^) -invariant  subspace. 

Now  consider  where  the  containment  is  proper.  To  simplify 

the  analysis,  we  will  assume  that£**fl*,  i.e.,  the  system  is  represented 


as  in  (4.2.14).  It  is  easy  to  see  that  X  must  be  an  (A  , B  )-invariant 

r  R  R 

subspace.  Furthermore,  we  want  X 


to  be  a  closed  loop  unobservable  subspace. 
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If  we  use  feedback  to  replace  C_  by  X  ,  this  Is  equivalent  to  saying  X  C7[[X..]. 

R  i  r  i 

So  we  might  as  well  choose  X^  such  that  X^  is  the  supremal  (A^,BR) -invariant 

subspace  in  7][X^],  i.e.,  jC*  for  the  subsystem  (A^,BR,X^).  Now  computing  such 

a  feedback  along  with  properties  of  the  resulting  subsystems  was  discussed  at 

some  length  above.  All  of  those  results  carry  over  directly  here. 

Finally,  we  note  that  the  two  cases  discussed  above  can  be  combined 

in  a  completely  straightforward  way  to  produce  all  sets  of  subsystems 

Only  note  that  when  fl* j4 0,  the  eigenvectors  of  A*  associated  with  the 

invariant  zeros  are  functions  of  the  feedback  in  A*.  The  possible  sets  of 

are  then  generated  by  first  selecting  X  and  then  X  • 

r  z 

4.2.6.  Direct  feedthrough  term 

MCA  can  be  applied  to  systems  of  the  form 


x(k+l)  **  Ax(k)  +  Bu(k) 
y (k)  -  Cx(k)  +  Du(k) . 


(4.2.24) 


In  fact,  Definition  4.2.1  can  be  used  directly  to  define  the  subspaces  £ 


j 


where  tr;e  closed  loop  system  is  now  (A+BL,B,C+ DL,D)  [20].  All  the  properties 
of  jC^T  go  through. 

To  apply  MCA  to  (4.2.24),  we  first  find  nonsingular  matrices  S  and 

o 


V  such  that 
o 


S  DV 
o  o 


D1  ° 

L  0  0  J 


(4.2.25) 


where  is  a  nonsingular  matrix.  We  interpret  Sq  as  an  output  space 

transformation.  Write 


s  c  - 

o 


k  ' 


BVo  -  [Bj  B2] 


(4.2.26) 


where  the  partitioning  is  compatible  with  (4.2.26).  Now  we  apply  MCA  to  the 
system  (A,B2>C2). 


Using  Definition  4.2.1,  we  have  that 


rj  -  mc2]  D  TIC]. 


(4.2.27) 


If  D«0,  then  JC*“7?[C].  Thus,  we  see  that,  in  general,  the  subspaces  f*  are 
larger  when  D^O.  However,  the  modifications  to  the  theory  are  minor. 


4.3.  (C.A)-Invariant  Subspaces 


The  second  class  of  subspaces  we  will  be  interested  in  is  (C,A)- 
invariant  subspaces.  These  subspaces  will  be  useful  in  constructing  dynamic 
observers.  These  subspaces  are  the  formal  dual  of  (A, B) -invariant  subspaces, 
but  we  shall  work  with  them  directly.  We  start  with  a  definition. 

Definition  4.3.1  [22]:  A  subspace  is  (C , A) -invariant  if  AG^n  7][C ])Ca/. 


Proposition  4.3.2  [22]:  J  is  (C, A) -invariant  if  and  only  if  there  exists  a  K 
such  that  (A+KC)«^C  J .  c 

Proposition  4.3.2  clearly  shows  the  dual  nature  of  (C, A) -invariant  and  (A,B)- 
invariant  subspaces. 

The  GHR  is  a  useful  basis  for  identifying  (C,A)-invariant  subspaces. 
Let  (C.1)-(C.2)  be  represented  after  one  step  of  chained  aggregation  as 


i 
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(4.3.1) 


where  Is  nonsingular.  First  note  that  any  subspace  a/  of  Rn  with  a  basis 
of  the  form 


V  - 


(4.3.2) 


where  X  Is  an  arbitrary  (n-r)xr  matrix  and  is  a  (C, A) -invariant  subspace. 

When  X*0,  this  is  Immediate  from  Proposition  4.3.2,  because  is  nonsingular. 
When  Xj*0,  introduce  the  state  space  transformation 


(4.3.3) 


Substituting  (4.3.3)  into  (4.3.1)  we  obtain 


y 

x 


'F11~F12X 


21 


12 

A22+XF] 


'  y  “ 

+ 

'  B1  ' 

X 

L  r  J 

.B2+XB1 . 

y  -  [ 


o 


(4.3.4) 


Note  that  this  transformation  has  preserved  the  information  structure,  i.e., 
the  GHR  structure.  In  this  new  basis,  J  is  spanned  by 


J  - 


(4.3.3) 
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Again,  applying  Proposition  4.3.2  shows  that  J  is  (C, A) -invariant.  In 
£act,  X  parameterizes  a  class  of  (C, A) -invariant  subspaces  S*,  an  element 
of  which  we  denote  by  It  is  also  clear  that  any  subspace  of  J  is  also 

(C ,  A) -invariant . 

Now  consider  a  subspace  with  a  basis  of  the  form 

(4.3.6) 


where  Y  is  a  (n-r)xr  matrix.  Now  we  have  that  n  7?[C]  Applying 

I  L 

Definition  4.3.1  to  (4.3.1)  we  have  in  matrix  form 


F11 

V 

'0‘ 

-A21 

A22- 

Y 

•  » 

-A22Y. 

(4.3.7) 


If  is  to  be  (C, A) -invariant,  we  must  have  that:  1)  A22^c^’  and 
2)  F^2^*0*  These  two  conditions  imply  the  is  an  unobservable  subspace 
of  (4.3.1).  If  satisfies  1)  but  not  2),  then  J  can  be  expanded  into 
a  (C, A) -invariant  subspace.  Define 


sp 


(4.3.8) 


Then  ^  is  a  (C ,A)-invariant  space.  This  analysis  enlarges  the  earlier 

defined  class  of  (C ,A)-invariant  subspaces,  S*.  For  each  X  in  (4.3.2) 

(4.3.9) 


where  Y  satisfies  (A2?+XF^7) , is  a  (C.A)-invariant  subspace.  Note  that 
Y  depends  on  X. 


It  can  be  shown  [22]  chat  Che  intersection  of  two  (C, A) -invariant 
subspaces  is  again  a  (C, A) -invariant  subspace.  Thus  any  subspace  of  Z  is 
contained  in  a  least  (C,A)-invariant  subspace.  As  an  example*  we 
will  identify  the  least  (C, A) -invariant  subspace  containing  8,  which  we 
denote  by  J. 

We  shall  see  that  J  can  be  identified  from  the  system  representation 
produced  by  MCA.  We  first  illustrate  the  idea  by  considering  generic  Case  1 
of  Section  4.2.  Then  the  system  is  represented  as  in  (4.2.14),  which  we  repeat 
here 


(4.3.10) 


y  * 


The  structure  of  the  input  matrix  in  (4.3.10)  shows  that  8  decomposes  as 


'i 

0 

8  =  sp 

r 

®  sp 

0 

.B22. 

(4.3.11) 


From  the  previous  discussion  it  is  immediately  recognized  that  is  already 
an  (C.A)-invariant  subspace.  Indeed,  the  transformation  in  (4.2.S)  (which 
is  similar  to  (4.3.3))  makes  this  transparent.  It  remains  to  find  a  sub¬ 
space  such  that  =  Now  must  be  (A77)-invariant  containing 

But  the  smallest  subspace  with  these  properties  is  (A^lB,,,,)  (-if). 

Thus  we  have  for  (4.3.10)  This  generic  case  illustrates  the 


following  general  result. 


Proposition  4.3.3  [44]:  J'n.f  *■$?*.  a 

Since  we  already  know  how  to  identify  <1*,  it  remains  to  identify  J 

* 

off  £  •  The  details  of.  the  general  case  are  quite  lengthy  and  can  be  found 
in  [42].  We  shall  briefly  describe  the  construction  for  the  following  system. 
Suppose  that  a  system  (C.1)-(C.2)  aggregates  after  two  cycles  of  MCA  and  has 
the  following  representation: 


-1 

y 

F11 

?12 

f13 

12 

y 

F11 

I 

0 

-3 

y 

- 

F11 

0 

0 

h 

F21 

F22 

F23 

x2 

r  J 

LA31 

A32 

A33 

(4.3.12) 


From  our  assumption  and  properties  of  MCA,  it  follows  that  and  are 
nonsingular  matrices.  We  introduce  the  addition  transformation 


(4.3.13) 


W 


-A 


32' 


-  4 


This  leaves  B  unaltered  and  replaces  A  by  a  zero  matrix.  Now  it  can  be  seen 
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that 


r  -l-i 
v 

••  m 

0 

•  « 

0 

T**  sp 

J 

.  o . 

©  sp 

_ 1 

a  sp 

h 

.0, 

e  <a33!B33> 


(4.3.14) 


The  first  term  in  (4.3.14)  is  a  subspace  of  fi  as  is  the  third  term  in 
(4.3.14).  This  third  term  is  in  7l[ C]  and  so  must  satisfy  the  two  conditions 
discussed  above.  This  generates  the  second  term  in  (4.3.14).  Of  course,  the 
fourth  term  is  <R*. 


4.4,  Examples 

4.4.1.  Pole-zero  cancellation 

As  a  short  and  interesting  example  which  ties  together  many  of  the 
preceding  ideas,  suppose  that  upon  applying  MCA  to  (C.1)-(C.2),  we  obtain 
a  representation  of  the  second  generic  kind,  i.e., 


(4.4.1) 


y  * 


From  Sections  4.3  and  4.4,  the  natural  basis  for  X  decomposes  into  two 
subspaces 


*  I  1 

0 

^  _  A 

-  *  £  =  sp 

r 

© 

0 

I 

n-r  J 

The  first  of  which  is  the  least  (C , A) -invariant  subspace  containing  3  and 
the  second  of  which  is  the  supremal  (A,B)-invariant  subspace  inu[C]. 
Furthermore,  \(A.)  are  the  invariant  zeros  of  (4.4.1). 

lx 


t 


Now  assume  that  Theorems  3.3.2  and  3.3.3  can  be  applied  to  (4.4.1). 


Then 


I A  I 

|P1*  *  6 


(4.4.3) 


6  -  sep(A1,A^) 


measures  the  near  unobservability  and 


I A  I 

,P2'  ~  6~ 


(4.4.4) 


measure  the  near  uncontrollability.  In  either  case,  some  of  the  eigenvalues 


of  A  are  given  by 


AUl  +  pJp^  Vi  (A^  +  A^MI  +  P^)'72  ] 
M(I  +  p2^)'Vl  (A4  +  P2A2)(I  +  P2P2),/J  ]  . 


(4.4.5) 


Thus,  if  the  system  is  either  nearly  unobservable  (II  P^B  is  small)  or  nearly 
uncontrollable  (II P2  B  is  small),  (4.4.5)  shows  that  some  of  the  eigenvalues 
of  A  approximate  the  invariant  zeros  of  (4.4.1).  In  fact,  it  would  seem  to 


be  the  product 


BA  I  -B A  1 
J  A3P1D  -  -  — — 


II A,  II  HA  I 

IIP -A-  II  <  — — - 
2  2  o 


(4.4.6) 


which  is  important  (cf.  (4.4.5)).  At  any  rate,  this  seems  to  be  a  state 


space  version  of  the  well-known  frequency  domain  phenomenon  of  approximate 


pole-zero  cancellation. 
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4.4.2.  Almost  invariance 

In  order  to  show  a  connection  between  the  material  in  this  chapter 
and  the  near  unobservability  concepts,  we  will  briefly  discuss  Willem's 
almost  invariant  subspaces  [45].  In  fact,  the  points  of  contact  seem  to  be 
so  strong,  that  a  complete  exposition  is  not  possible  at  this  time. 

In  the  interest  of  brevity,  we  simply  list  the  necessary  defini¬ 
tions  and  theorems  from  [45]  where  the  reader  is  referred  for  details. 

Let  d(x,K)  represent  the  distance  from  a  point  x  to  a  set  K  ,  both  in  Rn. 

Let  A  =  A+BF  where  F  is  a  matrix  dependant  on  s. 
s  s  s 

Definition  4.4.1  [45]:  A  subspace  Y_  C%  is  said  to  be  an  almost  invariant 

“■*“  di 

subspace  if  Vx  €Y  and  £  >  0,  there  exists  x(t)  such  that  x(0)  *  x  and 
- * -  o  a  o 


d(x(t)  e  Vt. 

A  subspace  C  %  is  said  to  be  an  almost  controllability  subspace  if 
a 

Vx  ,x.€  <R  ,  there  exists  T  >  0  such  that  Ve  >  0  there  exists  x(t)  with 
o  1  a 

the  properties  that  x(0)  =  x  ,  x(T)  =  x.  and  d(x(t)  )  £  e,  Vt.  D 

ox  a 

Theorem  4.4.2  [45,  Theorem  2]: 

1)  Y  =  Y  +  ^  for  some  4?  if  V  is  almost  invariant. 

a  a  a  a 

2)  Si  is  an  almost  controllability  subspace  if  and  only  if  there  exists 

a 

an  F  and  a  chain  {3^}  such  that  5?a  =  ^n-  ° 

Theorem  4.4.3.  [45,  Theorem  8]: 

Assume  that  Y  is  almost  invariant  and  that  there  exist  subspaces 

such  that  Y  -*Y  where  A„V  c  Y  . 

o  a  o  o  p 

1)  If  F  -+F,V  is  (A.B)-invariant  and  (A+BF)  Y  C  V  . 

o  a  a  a 

2)  If  Y  is  not  (A,B)-invariant,  then  F  -*■  ».  - 

We  shall  discuss  these  notions  for  a  system  which  is  of  generic 
case  2,  i.e.  the  system  is  represented  as 
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LXr 


4  J 


[I  0] 


LXr  J 


LXr 


V 


(4.4.7) 


Now  suppose  we  apply  feedback 


u  -  L1(5)y+L2(4)xr 


Then  the  closed  loop  system  is 


(4.4.8) 


y 

LXrJ  L 


Al+BlLl(o)  A2+B1L2(42) 


l_  r- 


(4.4.9) 


Now  in  (4.49)  is  an  almost  invariant  subspace  by  Theorem 

4.4.3  (1).  Simply  take  L^p)  »  0  and  define  L2  (p)  such  that 


L2(P)  •*  -A2  as  p  + 


(4.4.10) 


Of  course,  the  approximating  invariant  subspaces ^  are  a  subset  of  the 
invariant  subspaces  of  (4.4.9). 

A  more  interesting  question  is  whether  ft  [B]  an  almost  invariant 
subspace.  Yes,  by  Theorem  4.4.2  (2)  and  by  Theorem  4.4.3  (2),  F_  -*■  ®.  Now 

i, 

suppose  that  we  take  L2  (p)  *  0  (for  simplicity)  and  let  (  p  )  -►  ».  For 
fixed  rt  is  <R[B]  near  an  invariant  subspace?  Yes,  by  Theorem  3.3.3! 

By  using  Theorems  3. 2. 2-3. 2. 3  we  have 

1  Ajl 


I  pll  <  .  ■ 

Up) 


(4.4.11) 


where 


5(p)  =  sep(A1  +  B1L1(p),A4) 


(4.4.12) 


4 
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and  HP II  measures  Che  distance  between 


sp 


[:] 


(4.4.13) 


and  «t[B].  By  suitably  choosing  the  limit  of  L^(p),  6  (p)  -*■  ®  as  p  •. 
Hence,  the  closed  loop  invariant  subspaces  approximate  &[B]  but  they  require 
high  gain. 

We  also  note  that  in  these  coordinates 


d(x  (t),  «[B])  -  Ixr(t)  I 


(4.4.14) 

Then  using  the  analysis  of  Section  3.4.2.  from  equation  (3.4.14)  we  have 


where 


Ixr(t)»  i  I  P^2««y(0)leXlmt  +  »P22leY4mt|lxr(0)l 


+  IP22I  •  I  A3 » •  lly(O)  I  (e  X4mt-eXlmt) 


7(0) 

0 


(4.4.15) 


7(0)" 

3Z 

_Pil 

T  “I 
P21 

xr(0) 

PT 

L  12 

P22_ 

(4.4.16) 


since  Definition  4.4.1  requires  that  x(0)Q3.  In  (4.4.15)  A,  -*■<*>  and  P,  _-*•  ® 

lm  1Z 

by  (4.4,11).  Hence,  for  any  x^(o)£JJ  we  can  find  p  large  enough  such  that 
d(x^,S)  <  g.  Thus  S  sat  sfied  Def.  4.4.1  and  so  it  is  almost  invariant. 

The  purpose  of  this  rather  superficial  analysis  is,  first,  to  show 
that  the  trajectory  definitions  of  almost  invariant  subspaces  can  be  translated 
into  the  topological  characterization  introduced  in  Section  3.3.  Secondly, 
the  GHR  framework  allows  us  to  identify,  in  particular,  the  structure  of  the 
high  gain  feedback  matrix  which  produces  almost  invariance.  These  two 
observations  suggest  that  the  GHR  may  be  useful  for  the  analysis  and  design 
of  high  gain  systems. 


CHAPTER  5 


SYSTEM  DECOMPOSITION 


5.1.  Introduction  ...  . 

In  the  previous  chapters  we  have  discussed  the  geometric 
structure  of  the  linear  system  (C.1)-(C.2).  The  general  approach  has 
been  to  study  this  structure  in  a  specific  basis.  Relative  to  this 
basis  we  were  able  to  decompose  the  system  into  subsystems.  See 
Section  2.2.2.  Until  now  we  have  not  discussed  the  decomposition 
aspect  of  this  work.  This  chapter  serves  as  a  bridge  between  the 
geometric  theory  above  and  the  system  theory  below. 

The  decomposition  of  large  scale  system  is  of  interest  in  its 
own  right.  Many  results  for  large  scale  systems  are  stated  in  this 
framework  [46-48].  The  description  of  a  large  scale  system  as 
interconnected  subsystems  can  occur  in  several  ways.  On  the  one  hand, 
the  system  model  can  be  built  up  by  joining  together  the  subsystem  in 
some  specified  way.  In  this  case  the  interconnection  description  is 
straightforward.  At  the  other  extreme,  composite  systems  are  sometimes 
decomposed  into  an  abstract  description  for  analysis  [46-50]  or  compensator 
design  [51]  .  In  this  paper  we  shall  discuss  a  decomposition  procedure 
which  falls  somewhere  between  these  two  extremes.  It  is  not  assumed 
that  the  given  system  obviously  decomposes  into  some  interconnected 
structure.  Rather,  the  procedure  exploits  the  basic  underlying 
structural  properties  to  obtain  a  suitable  decomposition.  The  decomposi¬ 
tion  procedure  described  here  is  based  on  output  or  information  structure 
(observability),  input  structure  (controllability),  and/or  their 
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Interaction.  When  considering  only  input  structure,  related  results 
have  been  obtained  by  OzgUner  and  Perkins  [52] ,  Siljak  and  Vukcevic  [511, 
and  Sezer  and  Siljak  [53].  Of  these,  OzgUner  and  Perkins  is  closest 
to  the  spirit  of  this  paper.  The  other  two  decomposition  methods 
result  in  single  input  subsystems.  We  have  no  such  restrictions  here. 

The  decomposition  procedure  presented  here  should  not  be  confused 
with  decomposition  procedures  which  serve  to  rearrange  a  given  inter* 
connected  subsystem  description  <  [50]  or  [54]  ,  for  example).  The 
purpose  here  is  to  identify  subsystems  appropriate  for  the  intended 
use  of  the  model. 

The  method  described  here  differs  from  existing  methods  in 
emphasizing  input-output  interaction.  We  shall  present  this  method 
with  two  basic  goals  in  mind.  The  first  is  model  reduction.  We  will 
show  in  a  later  chapter  how  several  model  reduction  methods  are 
related  to  the  open  loop  geometry  and  the  system  decomposition  it 
induces.  Clearly,  input-output  interaction  is  useful  here.  The  second 
goal  is  compensator  design.  Since  this  interaction  structure  has  proven 
useful  in  feedback  design  [28]  we  are  able  to  identify  system 
descriptions  which  are  useful  in  closed  loop  design.  This  includes 
both  centralized  and  decentralized  control.  Chapter  7  is  devoted  to 
these  design  ideas. 

In  Section  5.2  we  will  briefly  summarize  the  previous  chapters 
to  establish  notation.  This  will  set  the  stage  for  the  model  reduction 
in  Chapter  6  and  the  centralized  compensator  design.  In  Section  5.3 
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we  extend  these  idees  to  systems  to  be  decomposed  Into  interconnected 
subsystems.  Section  5.4  applies  these  results  to  a  two  area  inter¬ 
connected  power  system. 


5.2.  Review  . . 

Consider  the  system  (C.1)-(C.2).  After  i  steps  of  chained 
aggregation,  (C.1)-(C.2)  is  represented  as 


• 

— 

* 

“  * 

y 

« 

ag 

CR 

y 

+ 

bg 

X 

r 

V. 

-  bgr 

v 

X 

L  rJ 

.  V 

(5.2.1a) 


y 

I 

I 


-  f 


(5.2.1b) 


(See  (2,2.8)  where  xf  ■  x*.)  Equation  (5.2.1)  suggests  that  we  think 
of  the  composite  system  as  two  interconnected  subsystems;  the 
aggregate  subsystem  given  by 


y  *  V  +  CRXr  +  bg  u 


and  the  residual  subsystem 


(5.2.2) 


x  »  A  x  +  B  v  +  Bn  u 
r  R  r  GR  R 


(5.2.3) 


(see  (2. 2. 9) -(2. 2. 10)) .  By  Proposition  3.2.4  we  have  that 


I 
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£ 


1 


-  sp 


(5.2.4) 


In  view  of  (5.2.4),  we  can  think  of  as  the  "state  space"  of  the 
residual  subsystem  (5.2.3).  Further,  CR  represents  the  information 
coupling  between  these  two  subsystems.  Considering  it  as  the  output 
matrix  of  the  residual,  we  say  the  system  (C.1)-(C.2)  has  been 
decomposed  into  interconnected  subsystems  (5.2.2)-(5.2.3)  based  on 
the  information  structure  of  the  system. 

By  dualizing  the  above  results  we  immediately  obtain  an 
input  structure  decomposition.  Indeed,  any  results  stated  below  for 
the  output  structure  have  a  dual  interpretation  in  terms  of  the  input 
structure.  However,  it  is  probably  more  useful,  because  of  the 
potential  closed  loop  applications,  to  investigate  how  the  input 
structure  overlaps  the  output  structure. 

Modified  Chained  Aggregation  (MCA),  described  in  Section  3.4.2, 
provides  a  system  representation  which  identifies  the  overlap  of  the 
input  and  output  structure.  MCA  will  produce  a  system  representation 
of  the  form 


(5.2.5)* 


y  -  [hl 


The  submatrices  in  (5.2.1)  and  (5.2.5)  are  not  the  same.  We 
have  relabeled  (5.2.5)  for  notational  simplicity. 
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(5.2.6) 


for  some  feedback  matrix  L  (L  is  identified  by  MCA).  Similar  to  (5.2. 1)» 
we  identify  and  aggregate  and  residual  subsystem  in  (5.2.5). 

Geometrically,  (5.2.5)  displays  a  decomposition  offi  ■  ft[B]  as 

B  -  ©nxj)  x©  (BnxJ)  (5.2.7) 


where 

BflfiJ  -  ft|“°  “j 

W 


(5.2.8) 


Note  that  BQ£^  is  unique  and  is  identified  explicitly  by  MCA.  This 
decomposition  splits  the  control  into  two  subvectors.  The  first 
subvector,  u,  influences  the  dynamics  of  the  aggregate  subsystem  and 
the  information  coupling  between  the  two  subsystems.  The  second  control 
vector,  3,  influences  the  dynamics  of  the  residual  and  the  affect  of  the 
aggregate  on  the  residual.  Since  J3  describes  how  the  controls  directly 
affect  the  state  of  the  system,  we  have  achieved  the  desired  input 
decomposition. 

Chapters  2-4  have  been  devoted  to  the  study  of  the  properties 
of  the  representations  in  (5.2.1)  and  (5.2.5).  Knowing  these  basic 
properties,  these  decompositions  should  be  useful  in  a  variety  of 
contexts.  We  shall  discuss  in  detail  their  relationship  to  two  major 


copies.  The  firsc  is  model  reduction.  Here  Che  aggregaCe  subsystem  is 
selecCed  Co  be  a  reduced  order  model  of  Che  original  sysCem  by  ignoring 


Che  coupling  Cerm  Of  course,  cercain  errors  are  made.  An 

aposceriori  error  analysis  Chen  shows  how  Che  nonuniqueness  of  Che 
cransformacions  can  be  used  Co  minimize  these  errors. 

The  second  topic  we  shall  discuss  with  respect  Co  these 
decompositions  is  compensator  design.  In  particular,  we  shall  introduce 
a  control  design  procedure  called  Three  Concrol  Component  Design,  which 
is  based  direcCly  on  (S.2.S).  We  shall  also  see  thac  (5.2.1)  is  useful 
for  observer  design  and  so  we  will  also  discuss  dynamic  compensators. 

5.3.  Interconnected  System  Decomposition 

In  this  section  we  investigate  the  role  of  the  input-output 
structure  in  decomposing  composite  systems  into  interconnected 
subsystems.  These  decompositions  result  when  the  submatrices  in  (5.2.1) 
and  (5.2.5)  take  on  a  special  form.  The  approach  here  is  to  use 
chained  aggregation  and  MCA  to  identify  the  inherent  information  and 
control  structure  of  the  composite  system.  Having  isolated  this 
structure  it  is  then  possible  to  recognize  how  these  structures 
decompose.  This  leads  to  various  decompositions  of  the  composite  system. 

This  approach  is  somewhat  ad  hoc  and  can  be  expected  to  be  most 
useful  when  there  is  some  underlying  physical  structure.  However,  it 
does  exhibit  great  flexibility.  The  number  of  decompositions  possible 
is  too  large  to  give  a  complete  listing.  Therefore,  we  shall  present 
a  number  of  examples  to  illustrate  the  approach. 
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The  first  three  examples  illustrate  decomposition  based  on 
information  structure  alone.  Then  we  turn  to  input-output  decompositions. 


Example  5.3.1.  Suppose  that  (5.2.1)  represents  (C.1)-(C.2)  after  one 

step  of  chained  aggregation  and  ■  I.  Furthermore,  suppose  that  y  In 

1  2 

(5.2.1b)  is  composed  of  two  subvectors  y  and  y  .  These  could  represent 
the  outputs  of  two  separate  "channels",  for  example.  With  this  finer 
structure  we  can  further  decompose  (5.2.1)  as 


agi 

AG12 

1 

1 

1 

1 

- 

CR1 

AG21 

AG2 

1 

1 

1 

1 

CR2 

bgri 

BGR2 

1 

1 

1 

1 

AR 

BG2  u 


(5.3.1) 


L  xd 

That  is  to  say,  the  assumption  of  two  separate  outputs  has  yielded  a 
further  partitioning  of  the  aggregate  subsystem.  Suppose  also  that 
A-g12  s  0  and  A_g21  =  Q.  This  is  not  merely  a  result  of  the  computational 
procedure  but  must  result  from  the  structure  of  the  system.  In  this 
case  (5.3.1)  takes  on  a  hierarchical  structure  of  two  subsystems  being 
driven  by  a  common  third  subsystem  (the  residual),  i.e. 


(5.3.3) 


Here  we  see  that  the  residual  subsystem  decouples  into  two  subsystems 
which  then  drive  the  aggregate,  i.e. 


See  Figure  2. 


C 


Residual 

Subsystem 


1st  Aggregate 
Subsystem 


2nd  Aggregate 
|  Subsystem  I 


Figure  1.  Example  5.3.1  -  Aggregate  decomposition. 


rm-r  9«« 

Figure  2.  Example  5.3.2  -  Residual  decomposition. 


Example  5.3.3.  Continuing  with  Example  5.3.2,  suppose  we  have  the 
structure  described  there  and,  in  addition,  the  matrix 

ECrL  C^]  (5.3.5) 

is  block  diagonal. 

This  structural  property  induces  a  further  decomposition  in 
the  aggregate  subsystem;  i.e.  (5.3.3)  becomes 


With  this  additional  structure,  (5.3.6)  can  be  interpreted  as  two  subsystems 
interconnected  through  their  outputs,  i.e. 
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BBRi 


—  — 

— 

CRi 

i 

y 

+ 

BGi 

U  + 

AGi,3-j 

ARi  _ 

i 

X 

_  r_ 

_  - 

_BGRi,3-j  _ 

i-1,2  .  (5.3.7) 


See  Figure  3. 

Equations  (5.3.6)  and  (5.3.7)  very  nicely  illustrate  the  close 
relationship  between  the  information  structure  in  (5.3.6)  (as  derived 
from  (5.2.4)  and  the  physical  structure  in  (5.3.7).  By  changing  our 
point  of  view  we  obtain  different  decompositions. 

Also  note  the  difference  in  the  decomposition  procedure  between 
Examples  5.3.1  and  5.3.3.  In  Example  5.3.1  a  partitioning  of  the  output 
was  assumed  and  then  this  was  used  to  induce  a  decomposition  in  state 
space.  In  Example  5.3.1  exactly  the  reverse  occurred.  Here,  the 
decomposition  in  state  space  lead  to  a  partitioning  of  the  output  vector.  C 

Thus  far  we  have  considered  some  decompositions  based  only  on 
the  information  structure.  Clearly,  by  dualizing  the  results  we  obtain 
decompositions  based  on  the  input  structure.  However,  as  noted  above, 
it  is  of  more  interest,  because  of  the  potential  for  closed  loop 
applications,  to  obtain  decompositions  based  on  both  the  input  and 
output  structure.  We  will  consider  several  such  decompositions  next. 


Example  5.3.4.  In  this  exampLe  we  consider  a  refinement  of  the  structure 

of  ExampLe  5.3.3.  In  addition  to  assuming  A  and  C  are  block  diagonal 

R  R 

in  (5.3.6)  suppose  that  BGR,  BQ  and  ER  are  also  block  diagonal. 
Furthermore,  Let 


Figure  3.  Example  5.3.3  -  Ouput  interconnection 
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AGi,3-i  “  BGiNil 

(5.3.8) 


BGRi, 3-i  “  BRiNi2 


Then  (5.2.5)  cakes  on  Che  form  of  two  systems  connecCed  in  a  feedback 
configuration,  i.e. 


(5.3.9a) 


i 


v 


+  u 


J 


(5.3.9b) 


See  Figure  4.  (Compare  with  (5.3.7).) 

Computationally,  the  existence  of  this  representation  depends 
on  the  block  diagonaliza tion  of  Br  and  B  (along  with  previous  assumptions) 

v  K 

and  the  existence  of  and  in  (5.3.8).  Since  we  have  allowed  input 
transformations,  there  is  some  flexibility  to  meet  these  conditions.  C 


Example  5.3.5.  In  this  example  we  will  generalize  the  structure  of 
Example  5.3.4  to  include  representations  where  the  interconnections 
(5.3.9b)  are  dynamic.  To  illustrate  the  basic  idea,  we  consider  the 
composite  case  first.  Suppose  we  have  obtained  the  representation  in 
(5.2.5)  and  =  0.  If  we  assume  that  B^,  is  square  and  nonsingular  we 


can  write 


Figure  4.  Example  5.3.4  -  Input-output  interconnection. 
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Here  Che  aggregate  decomposes  into  two  subsystems. 


yl  -  w1  +  +  Br.tvi  *  1-1  • 2 


(5.3.13) 


where  th i  interconnections  v  are  now  dynamic.  They  are  given  by 


‘r  “  ARi*r  Lijy 


vi  *  Nixr  +  ^i.S-i7' 


i-1,2 


(5.3.14) 


See  Figure  5. 


One  case  of  particular  interest  occurs  when  and  L22  are 
both  zero  in  (5.3.12).  This  uncouples  the  interconnection  equations 
in  (5.3.14)  and  results  in  a  simplification  of  the  interconnection 
structure.  See  Figure  4  where  the  are  now  thought  to  represent 
dynamic  connections.  This  represents  a  particular  case  of  the 
interconnected  structure  used  in  [50].  □ 

We  have  given  here  by  no  means  an  exhaustive  List  of  possible 
structures.  The  similarity  between  the  structures  in  Examples  5.3.2 
and  5.3.5  indicates  the  variety  of  interpretations  possible.  This 
flexibility  should  allow  the  representation  to  match  its  intended  use. 

The  structural  decompositions  discussed  above  occur  frequently 
in  the  large  scale  system  literature.  For  example,  certain  conditions 
have  been  given  for  dynamic  decentralized  stabLization.  These  conditions 
have  been  specialized  for  the  structures  discussed  in  Example  5.3.4  [46] 

and  Example  5.3.5  [49],  A  particular  control  strategy  has  been  proposed 


F  P-7347 


Example  5.3.5  -  Dynamic  interconnection. 


for  systems  exhibiting  the  structure  of  Example  5.3.2  [28]  .  Recently, 


a  decomposition  in  the  spirit  above  was  used  in  a  dynamic  game  context  [55] 
The  dual  decompositions  of  Example  5.3.3  have  been  used  in  stability 
analysis  [48] . 

5.4,  Two  Area  Power  System 

In  this  section  we  apply  the  ideas  of  the  last  section  to 
decompose  a  two  area  power  system.  This  example  illustrates  the  close 
connection  between  the  physical  structure  and  the  information  structure 
and  how  this  interrelationship  can  be  used  to  obtain  different 
decompositions. 

We  will  consider  a  two  area  power  system  in  which  each  area 
contains  two  thermal  power  plants.  The  outputs  are  the  frequency 
deviation  in  each  area  and  the  tie  line  power  flow.  A  description  of 
the  system  model  and  parameters  is  given  in  the  Appendix.  Physically 
we  would  expect  the  model  to  decompose  into  two  interconnected  sub¬ 
systems  but  the  decomposition  is  not  immediate  because  the  tie  line  does 
not  fit  conveniently  into  either  area.  As  suggested  in  Section  5.3  we 
apply  one  step  of  chained  aggregation  to  obtain 


This  is  exactly  the  structure  of  Example  5.3.2.  Note  that  (5.4.1)  does 
not  further  reduce  to  the  aggregate  structure  of  Example  5.3.3.  This 
is  the  reason  we  were  not  able  to  identify  two  subsystems  in  the  original 
model.  .We  can,  however,  obtain  a  hierarchical  structure  for  (5.4.1)  by 
permuting  the  states  y^  and  : 


h2l  h23 


h12  *99  0 


h32  0 


0  0 


C1  ° 


0  c„ 


0  o 


C91° 


0  cr 


(5.4.4) 


This  shows  that  we  can  associate  the  states  y^  and  y^  with  their  residual 
subsystems,  respectively.  This  is  perhaps  a  more  physically  meaningful 
decomposing  in  that  each  residual  subsystem  now  represents  exactly  each 
area  while  the  aggregate  subsystem  is  exactly  the  tie  line.  However,  it 
may  not  be  as  useful  for  control  design  since  the  control  does  not  appear 
in  the  aggregate. 

Returning  again  to  the  decomposition  in  (5.4.1)-(5.4.3),  write 


AL  -  diag  {aJl,AL22},  BL  »  diag 


11’  22- 


(5.4.5) 
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Noting  that 


D 


1 

j 


B1,  N* 

jj  j 


j-1,2 


(5.4.6) 


we  see  that  each  subsystem  decomposes  into  two  subsystems  (representing 
each  plant)  as 


4i  J  ,  i  „i  ai  i  i 

jj  rj  +  BjjV  BJjUj 


j“l»2  oj^**!*  < *2*3* 


(5.4.7) 


This  is  similar,  but  dual,  to  the  structure  in  Example  5.3.2. 

As  mentioned  above  these  decompositions  may  not  be  useful 
since  the  control  variables  do  not  appear  in  the  equations  for  the 
output  variables  (which  we  want  to  control).  To  obtain  a  different 
decomposition,  we  continue  to  apply  chained  aggregation  to 
(5.4. l)-(5.4.3)  until  the  control  appears  in  the  aggregate.  Since 
the  residuals  are  decoupled  we  can  apply  chained  aggregation  to  each 
subsystem  separately.  In  this  case  two  stages  of  chained  aggregation 
are  needed.  The  result  is 
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From  (5.4.8),  we  see  each  subsystem  has  one  output  and  two 
inputs.  To  isolate  the  residual  control,  we  apply  an  input  trans¬ 
formation  as  in  (5.2.9): 


so  that  the  input  matrix  in  (5.4.8)  becomes 


We  now  decompose  the  subsystems  as  indicated  by  dashed  lines  in  (5.4.9) 
and  group  it  with  (5.4.1)  to  obtain  the  decomposition: 
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(5.4.11b) 


i=1.2 


In  (5.4.11)  we  have  represented  the  original  model  as  three  interconnected 
subsystems  as  described  in  Example  5.3.2.  As  opposed  to  the  earlier 
decomposition  (5.4. 1) - (5.4. 3) ,  at  least  one  component  of  the  control  enters 
each  subsystem.  This  feature  makes  this  decomposition  attractive  for 
closed  loop  design.  This  decomposition  also  illustrates  how  the  ideas 
in  Section  5.3  extend  directly  when  more  than  one  step  of  chained 
aggregation  is  used. 


CHAPTER  6 


MODEL  REDUCTION 


6.1.  Introduction 

Chained  aggregation  and  the  GHR  was  originally  Introduced  as  a 
model  reduction  technique  [2].  Indeed,  it  is  an  extension  of  Aoki's  concept 
of  aggregation  [8].  Since  Aoki's  original  work,  much  insight  has  been  gained 
into  aggregation  [1,56-62].  In  this  chapter,  we  will  show  how  many  of  these 
ideas  can  be  unified  and  extended  using  the  GHR  framework. 

The  purpose  of  this  chapter  is  not  an  indepth  review  and  discussion 
of  model  reduction  techniques,  but  to  point  out  where  the  GHR  and  its  geometry 
come  into  play  in  reduced  order  modeling.  The  most  important  aspect  of  this 
chapter  is  not  the  details  of  any  specific  reduction  procedure,  but  a  general 
understanding  of  how  the  geometric  structure  of  (A,B,C)  relates  to  system 
theoretic  ideas.  If  the  final  goal  is  compensator  design,  one  can  question 
the  wisdom  of  open  loop  model  reduction.  Therefore,  the  true  benefits  of 
this  chapter  will  be  realized  when  these  ideas  are  combined  with  compensator 
structures  discussed  in  the  next  chapter. 

In  Section  6.2  we  will  discuss  Aoki's  aggregation  concepts  [8], 
the  birthplace  of  the  GHR.  This  will  include  its  abstract  form  as  a  projec¬ 
tion  method.  Section  6.3  applies  these  ideas  to  modal  methods  for  reduction. 
Two  different  error  analysis  methods  are  presented.  Section  6.4  discusses 
model  reduction  by  using  properties  of  the  controllability  and/or  observ¬ 
ability  gramian.  Section  6.5  relates  the  GHR  to  the  cost  decomposition  work 
of  Skelton  [ 11] . 

Throughout  this  chapter  we  shall  assume  that  the  system  (C.1)-(C.2) 
is  asymptotically  stable  and  observable.  These  assumptions  are  not  always 


required,  particularly . in  Sections  6.2  and  6.3,  but  this  allows  for  a  unified 
treatment.  The  appropriate  generalizations  can  be  easily  made  by  the  reader. 


6.2.  Aggregation  _ 

6.2.1  Projection  _ _ _  _  . 

We  first  consider  Aokifs  algebraic  concept  of  perfect  aggregation. 
Thus,  we  Interpret  the  output  equation  (C.2)  as  defining  the  variables  to  be 
approximated,  i.e.,  C  Is  the  "aggregation  matrix"  [8}.  According  to  Aokl  [8  ] 
the  system  aggregates  if  there  exists  a  matrix  F  such  that 


CA  -  FC. 


In  this  case,  the  reduced  order  model  is 

z  »  Fz  +  Gu 
G  -  CB. 


(6.2.1) 


(6.2.2) 


To  see  how  this  relates  to  the  GHR,  consider  (C.1)-(C.2)  after 
one  step  of  chained  aggregation,  i.e.. 
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(6.2.3) 


In  the  basis  (6.2.3)  it  is  easy  to  compute  (6.2.1).  We  have 

7 

CA  *  CltFll  F12]  "  F[C1  °]  =  FC‘ 


(6.2.4) 


Assuming  Is  nonsingular  (C  has  full  row  rank) ,  F  exists  exactly  when 


F^2*0,  in  which  case 


Also  note  that 


F  ■  C  .  -w 

rii’i 


CB  -  G  -  C^. 


(6.2.6) 


Thus.  contains  all  the  Information  of  CB.  From  (6.2.5)-(6.2.6)  the 
reduced  order  model  (6.2.2)  Is  constructed. 

The  geometric  Interpretation  of  the  GHR  gives  us  Immediately  a 
geometric  Interpretation  of  aggregation.  From  Theorem  3.2.4 


X1  '  Xn  ‘  sp 


(6.2.7) 


is  the  unobservable  subspace  of  (6.2.3).  Hence,  systems  that  aggregate  in 
Aoki's  sense  exhibit  this  very  strong  form  of  unobservablllty . 

Aggregation  has  also  been  interpreted  as  a  projection  method (.57-59] 
This  also  has  an  easy  interpretation  in  the  basis  of  (6.2.3).  Assume  that 
C^*!  (or  change  basis  in  the  state  space).  Then  the  projection  matrix  P  is 


[■  •]. 


(6.2.8) 


A  projection  matrix  is  characterized  by  1)  the  subspace  being  projected 
along  and  2)  the  subspace  being  projected  on.  Here  the  subspace  being 
projected  along  is 


71  [C]  «  sp 


(6.2.9) 


This  subspace  is  unique  and  determined  by  C.  The  subspace  being  projected 
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on  is 


(6.2.10) 


Note  that  F12-°  implies  the  rows  of  C  (the  aggregation  matrix) 

T 

span  a  left  eigenspace  of  A.  Since  fl[C  ]1??[C],  the  reduced  order  model  is 
determined  by??[C]  [62].  This  fact  is  often  used  in  constructing  an  aggre¬ 
gation  matrix  [57-60]. 


6.2.2.  Coherency 


As  an  application  of  these  ideas  we  mention  some  recent  work  in 


coherency  of  power  systems.  While  coherency  has  been  a  standard  topic  in  power 
systems,  we  shall  follow  the  recent  new  approach  of  Kokotovic,  et  al  [63,64] 


where  the  reader  is  referred  for  details.  The  system  model  is  given  by 


x  »  Ax,  x(0),  t >  0. 


(6.2.11) 


If  (6.2.11)  represents  a  disturbed  system,  then  the  disturbances  are 

modeled  in  the  initial  condition.  States  x^  and  x^  of  system  (6.2.11)  are 

coherent  with  respect  to  n-r  modes  of  A.  a  .  if  and  onlv  if  none  of  these 
—  a 

modes  is  observable  from 

y^(t)  -  (t)  -  x±(t) .  (6.2.12) 

Now  suppose  we  have  chosen  n-r  modes  o^  and  that  there  exists  r 
distinct  groups  of  coherent  states,  i.e.,  groups  of  states  coherent  to  each 
other.  In  each  group  pick  a  reference  state  and  let  these  be  the  last  r 
states  in  (6.2.11).  Now  define  an  (n-r)xn  output  matrix  C  for  (6.2.11)  by 

C  -  [I  -L  ].  (6.2.13) 

n-r  g 

The  matrix  L  contains  the  grouping  information.  The  (i.j)th  element  of  L 
S  S 


90 


is  1  if  state  zi  is  in  the  j-th  group,  i«l,...,n-r  and  j«n-r+l, . . . ,n. 
Note  that  each  row  of  (6.2.13)  defines  an  output  as  in  (6.2.12).  Next 
define  a  similarity  transformation 


(6.2.14) 


in  which  case  (6.2.11)  becomes 


(6.2.15) 


where  R[L  ]  is  a  Riccati  equation  in  L  .  Using  this  setup,  it  is  shown 
8  8 

that  the  outputs  defined  by  (6.2.13)  are  coherent  with  respect  to  the  modes 

a  if  and  only  if  R[L  ]  «  0.  In  this  case  a  m  X(A  ). 
a  g  a  a 

This  whole  framework  fits  directly  into  the  discussion  above.  We 
note  that  the  transformation  in  (6.2.14)  transforms  (6.2.13)  into 


[Ir0j. 


(6.2.16) 


Hence,  this  is  the  first  step  of  chained  aggregation.  Then  (6.2.15)  aggre¬ 
gates  if  and  only  if  R(L  ]  *0  and  the  unobservable  modes  are  X(A  ).  In 

g  a 

other  words,  jC^  for  (6.2.15)  is  the  unobservable  subspace.  This  shows  that 
coherency  is  Intimately  related  to  the  information  structure  of  the  system. 

Coherency  is  a  topic  closely  related  to  physical  systems  which  do 
not  satisfy  the  strict  algebraic  requirements  above.  Therefore,  a  common 
approach  is  to  define  near-coherency .  This  can  be  formalized  in  the  frame¬ 
work  above  by  requiring  in  a  nearly  coherent  system  that  the  contribution 

of  modes  a  to  the  output  (6.2.12)  be  small  [64].  This  idea  can  be  made 
a 

precise  by  introducing  near  unobservability  of  Section  3.3.  If  a  system  is 


coherent  when  Is  unobservable,  then  it  is  nearly  coherent  if  is  nearly 

unobservable.  In  fact,  the  contribution  of  the  inodes  a  to  the  output 

a 

trajectory  has  been  bounded  in  (3. 4. 15).  There  we  see  that  coherency  is 

strongest  when  there  is  a  separation  between  A.  (A  )  and  A  (A.)  in  (6.2.15),  a 

a  a 

fact  which  has  already  been  noted  [  64] .  We  also  note  that  the  Riccati  approach 
to  coherency  is  very  similar  to  the  approach  to  near  unobservability  in 
Section  3.3. 

An  approach  to  near  coherency  very  similar  to  the  one  here  has  been 
proposed  in  [12]  under  a  slightly  different  formulation.  Those  results  can 
be  recovered  by  using  the  subspace  measure  proposed  in  Section  3.3. 


6.3.  Modal  Methods 

6.3.1.  Preliminaries 

There  is  a  number  of  model  reduction  methods  [7,9,65],  based 
essentially  on  the  open  loop  modes,  that  fit  nicely  into  the  aggregation 
framework.  Suppose  that  the  system  (C.1)-(C.2)  is  represented  as 


(6.3.1a) 


(6.3.1b) 


We  wish  to  construct  a  reduced  order  model  for  the  output  variables  y  from 
some  set  of  r  modes  of  (6.3.1).  This  model  will  be  identified  by  selecting, 
sequentially,  two  state  space  transformations.  The  first  transformation  is 


the  orthogonal  transformation,  to  preserve  as  ouch  structural  Information 


K- 

r.  ■■ 
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as  possible.  Introduced  in  equation  (3.3.7), 
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Here  P  is  chosen  such  that  the  transformed  system  is  given  by 
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The  second  stage  is  the  selection  of  a  second  transformation 
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X  I 
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Then  (6.3.3)  is  transformed  into 
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F21  -  F21  +  XF11-  F22X’  G2-G2  +  XGr 


(6.3.2) 


(6.3.3) 


(6.3.4)  | 


(6.3.5a) 


(6.3.5b) 

(6.3.5c) 


From  this  representation  we  obtain  the  reduced  order  model 


9  ■  FU7  *  GjU 

yr  '  [Pll-P12XIii- 


(6.3.6) 


The  error  is  calculated  from 

e(t) 


T  y<0  7 

■  y(t)-y  (t)  -  [0  P -A 

r  12  [x.(t) 


(6.3.7) 


P12*r(t)* 


From  (6.3.5)  xt(t)  is  given  explicitly  from 


i  (t)  -  e  22i  (0)  +  / 


t  F„„(t-T) 


22 


G,(X)u(T)dx  +  /  e 
0 


t  F._(t-r) 


22 


F21(X)y(t)dt 

(6.3.8a) 


F  .t  t  F  (t-s) 

y(t)  -  e  1  y(0)  +  f  e  11  G  u(s)ds. 

0  1 


(6.3.8b) 


So  these  methods  proceed  as  follows.  First  (6.3.2)  is  selected. 
Comparing  (6.3.6)  to  (6.3.3),  we  see  that  this  is  the  same  as  selecting  the 
eigenvalues  of  the  reduced  order  model  to  be  a  subset  of  the  open  loop  eigen¬ 
values  of  (6.3.1).  In  forming  the  reduced  order  model  (6.3.6),  an  error  is 
made  (6.3.7).  By  specifying  a  cost  function  on  e(t),  we  can  select  a  second 
transformation  (6.3.4)  to  reduce  this  error.  On  the  other  hand,  X  may  be 
selected  to  satisfy  some  other  criteria  (F21(X)*0  or  (X)  *  0)  and  then  an 
a  posteriori  error  analysis  performed. 

This  method  is  clearly  a  projection  method.  The  subspace  to  be 
projected  along  is  determined  by  the  transformation  (6.3.2)  (the  span  of  the 


last  (n-r)  columns) .  The  subspace  to  be  projected  on  is  determined  by  the 
second  transformation  (6.3.4)  (the  span  of  the  first  r  columns). 


6.3.2.  Dominant  modes 

This  method  of  model  reduced  proceeds  by  first  reducing  A  In 
(6.3.1)  to  a  diagonal  form.  In  our  framework  this  means  selecting  X  such  that 
P  (X)-0.  Then  to  produce  the  correct  steady  state  error,  the  neglected 
state  variables  are  approximated  by 


*r  ’  0  '  F22*r  +  °2U 


*r  ‘  -r2252u(t)- 


(6.3.9) 


In  this  case  the  error  equation  (6.3.7)  is  modified  as 


e(t)  *  P12(xr(t)  +  F"p22u(t)) 


(6.3.10) 


If  we  assume  as  zero  initial  state  and  a  constant  input  of  magnitude  uq,  then 
we  can  estimate  the  error  using  the  analysis  in  [66].  Note  that  F^2  =*  0 
eliminates  x^  dependence  on  y  and  simplifies  the  analysis.  Let  T2  be  the 
(n-r) x (n-r)  matrix  of  eigenvectors  for  F22  and  A2  the  corresponding  modal 


matrix. 


t  F  (t-r)  , 

I e(t)l  <  IP ,,11/  e  “  G2(X)uQdt  +  F^G^uJ 


22  2  o' 


t  A-(t-t)  , 
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<  |P12lie(T2)|G2|.|uo|.|A2Ae  II 
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(6.3.11) 


In  this  bound  we  identify  three  terms.  The  first  is 


k(T2) 

minjA^Aj)  j  * 


(6.3.12) 


This  term  bounds  the  dynamic  response  of  the  neglected  residual  subsystem. 

Note  that  this  term  tends  to  zero  as  the  dynamics  become  arbitrarily  fast. 

This  indicates  that  the  fast  modes  should  be  neglected. 

The  second  term  is  I  P^l  •  This  measures  the  contribution  of  the 

residual  subsystem  to  the  output.  Clearly,  we  want  this  to  be  small.  Taken 

in  combination  with  the  first  term,  we  see  that  we  would  like  the  system 

(6.3.1)  to  be  nearly  unobservable  (see  Section  3.3). 

The  third  term  is  IG„|  *lu  I.  This  measures  the  excitation  of  the 

2  o 

residual  system  by  the  input,  again  a  term  we  would  like  to  be  small.  It  is 
not  hard  to  see  that  in  (6.3.6)  if  *  0  and  is  small,  then  the  dominant 
invariant  subspace  approximates  B  .  This  gives  a  guideline  for  selecting  the 
dominant  Invariant  subspace  to  be  retained. 

Bounds  similar  to  (6.3.11)  were  given  in  a  series  of  notes  [66-68]. 
However,  they  ignored  the  geometric  structure  of  the  problem  in  favor  of 
emphasizing  the  dependence  on  time  scale. 


6.3.3.  Mltra's  method 

Mitra's  method  [9]  is  a  systematic  way  of  choosing  X  in  the  second 

transformation  (6.3.4)  to  minimize  the  integral  squared  error  (6.3.7).  The 

first  step  is  exactly  the  same  as  in  dominant  mode  selection.  The  first 

transformation  (6.3.2)  is  chosen  to  yield  (6.3.3)  though  the  eigenvalues  to 

be  retained  are  not  yet  specified.  Now  introduce  the  cost  function 

00 

min  E(X)  -  /  |e(t)i2dt.  (6.3.13) 


So  the  second  step  of  the  method  is  to  choose  the  second  transformation 
(6.3.4)  to  minimize  (6.3.13).  Bear  in  mind  that  (6.3.13)  is  parameterized 
on  the  first  transformation  (6.3.2). 


Next  note  that 
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where 


trace  P12(XQ1XT + Q2XT+ XQ2 + Q3]P^2 


(6.3.15) 
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(6.3.16) 


is  the  well-known  controllability  gramian  computed  for  the  system  (6.3.3). 
However,  this  gramian  has  the  same  properties  as  the  controllability  gramian 
for  (6.3.1)  because  (6.3.3)  was  obtained  from  (6.3.1)  by  an  orthogonal 


transformation. 


Since  (6.3.15)  is  always  greater  than  zero,  we  can  minimize  the 
integral  of  the  error  by  minimizing  the  quantity  in  brackets  in  the  last 
term.  Thus 


■jjj  l  trace  XQ^1  +  Q^X1  +  XQ2  +  Q3] 


-  2(3^  +  2Q2 


(6.3.17) 


T  -1 
X  *  ^1  Q2* 


(6.3.18) 


The  optimal  value  of  the  cost  function  (6.3.15)  is 


J  le(t)H2dt  -  trace  tQ3-Q2QllQ2lP12‘ 


(6.3.19) 


Thus  for  each  P,  i.e.,  each  invariant  subspace,  (6.3.19)  gives  the  associated 


cost.  The  straightforward  application  is  then  to  look  at  all  r-dimensional 
invariant  subspaces  and  choose  the  one  with  least  cost  [9,60]. 

The  above  analysis  provides  more  insight,  however.  First,  Q  is 
calculated  from 


F  F 
21  22 


Q3-l  Lq* 


t  n  T  T  - 
<*2  F11  F21 

Q  J  L  0  F77. 


1  [GF  GF].  (6.3.20) 


This  decomposes  as 


FnQi +  QiFn 


T 

-G1G1 


(6.3.21a) 


F11Q2  +  Q2F22 


'(GlG2  +  QlF21> 


(6.3.21b) 


I 


—  trace  XQ  *  Q. 


To  compute  X  In  (6.3.4)  first  solve  (6.3.21a)  followed  by  (6.3.21b).  Thus, 
the  full  order  Lyapunov  equation  need  not  be  solved.  The  savings  in  compu¬ 
tation  may  be  great  if  the  order  of  the  reduction  is  large. 


Secondly,  substituting  X  in  (6.3.18)  into  (6.3.15)-  we  obtain  the 
controllability  gramian  of  the  optimal  representation  to  be 


-Ql  0  - 

.°  vQXV ' 


(6.3.22) 


This  is  reminiscent  of  Moore's  balancing  technique  [10].  Indeed,  the  control¬ 
lability  gramian  for  the  reduced  order  model  is  the  same  as  that  subsystem 
embedded  in  the  original  representation  (see  (6.3.21a)).  For  further 
discussion  see  Section  6.4. 

Finally,  consider  (6.3.8)  (cf.  (6.3.7))  which  essentially  describes 
the  error.  The  error  is  generated  by  two  inputs:  1)  u(x)  directly  into  the 
residual  subsystem  through  G^,  and  2)  u(t)  filtered  through  the  aggregate 
subsystem,  i.e.,  y(x),  through  E^.  Both  and  F^  are  functions  of  X. 

So  we  interpret  the  calculation  of  X  as  trading  off  the  effect  of  the  input 
and  the  effect  of  the  aggregate  on  the  error. 

The  error  analysis  of  Mitra's  method  yields  results  similar  to  the 
dominant  mode  method.  Again  we  would  like  the  system  to  be  nearly  unobserv¬ 
able,  i.e.,  IIP^O  smaH  anc*  the  eigenvalues  of  F^-,  fast.  We  would  also  like 
to  have  F^9  and  G^  small  together.  This  has  the  effect  of  diagonalizing  Q  in 
(6.3.16)  (cf.,  the  analysis  of  weak  observability).  Then  the  separation  of 
the  eigenvalues  will  ensure  a  small  cost  (6.3.19). 


We  can  combine  the  observations  to  identify  characteristics  of 
subspaces  used  to  produce  a  reduced  order  model.  For  clarity,  suppose  the 
invariant  structure  decomposes  as 


Vvd 


(6.3.23) 


where  is  the  invariant  subspace  of  retained  modes  while  V,  is  its 
r  a 

complementary  invariant  subspace  of  discarded  modes.  Then 

(1)  Y,  should  be  near  7?[C].  This  implies  IF. .1  is  small. 

a  12 

(2)  should  be  nearly  orthogonal  to  Vr*  This  implies  If^i*  is  sma11- 

(3)  Yr  should  be  near  This  implies  IG^I  is  small. 

(4)  Y  should  contain  the  slow  modes  and  flr.  the  fast  modes.  This 

r  a 

implies  the  effect  of  the  aggregate  and  input  on  the  residual  will 
be  small. 

We  shall  see  that  (4)  is  structurally  related  to  (2);  i.e.,  systems  with 
separated  eigenvalues  tend  to  have  property  (2).  Also  (2)  and  (3)  combined 
with  (4)  implies  that  the  controllability  gramian  tends  to  be  diagonal  with 
separated  singular  values.  Similarly,  (1)  and  (2)  combined  with  (4)  leads 
to  weak  observability.  The  combination  of  these  two  concepts  will  be 
discussed  in  the  next  section. 


6.4.  Balancing  Techniques 


6.4.1.  Internal  analysis 


The  discussion  of  Mitra's  method  in  the  last  section  provides  a 


bridge  between  modal  model  reduction  and  model  reduction  based  on  the 
controllability  and  observability  gramian  [10-11],  here  called  balancing 


techniques.  The  idea  here  is  to  choose  state  coordinates  such  that  the 
controllability  and/or  observability  gramian  exhibit  special  strucuuture, 
l.e.,  they  are  both  diagonal  with  the  same  eigenvalues  (called  second  order 
modes  [10]).  Then  if  there  is  a  separation  in  the  spectra,  the  weakly 
observable/controllable  subsystem  is  discarded.  As  an  example,  note  that  if 
B=0,  then  this  reduction  is  just  weak  observability. 

Thus,  we  see  that  Mitra's  method  is  a  combination  of  modal  methods 
and  balancing  techniques.  The  modal  analysis  enters  in  the  first  stage  as 
already  discussed.  The  balancing  is  performed  in  the  second  stage  by 
selecting  a  basis  such  that  the  controllability  gramian  is  block  diagonal 
(see  (6.3.22)). 

The  balancing  technique  is  related  to  the  framework  presented  here 
as  follows.  Given  the  system  (6.3.1),  choose  P^  in  (6.3.2)  such  that  K, 
the  observability  gramian,  is  block  diagonal,  i.e., 

-  K.  (6.4.1) 

Since  is  orthogonal,  the  eigenstructure  of  K  is  preserved.  Now  introduce 
a  second  transformation 
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where  is  chosen  such  that 

ViKiVi  =  1  i =  1,2. 
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This  makes  the  observability  gramlan  an  Identity  matrix.  Now  select  a 
second  matrix  of  the  form  (6.3.2)  such  that  the  controllability  gramlan  Q 
is  block  diagonal,  l.e.,  select  such  that 


P 


2 


(6.4.4) 


Since  the  observability  gramlan  K  is  the  identity  it  is  unaffected  by  P^. 

The  second  order  modes  are  now 

°si  "  (o(Qi))Vl  1  *  1*3.  (6.4.5) 


A  transformation  similar  to  V  in  (6.4.2)  will  create  identical  control¬ 
lability  and  observability  gramians.  The  idea,  clearly,  is  to  choose  P^  such 
that  the  spectra  of  and  Q2  are  separated,  if  possible.  In  this  case,  the 
subsystem  associated  with  the  larger  spectra  is  retained. 

In  Section  3.3  we  discussed  the  relationship  between  near  unobserv¬ 
ability  and  the  observability  gramlan.  A  dual  theory  immediately  follows  for 
controllability.  How  then  are  these  concepts  related  to  balancing 
techniques?  Suppose  that  we  compute  the  transformations  above,  but  delete 
V  in  (6.4.2).  Then  following  the  transformation  i*2  *n  (6-4.4),  the  observ¬ 
ability  gramian  (6.4.1)  becomes 
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The  intuition  is  this.  Suppose  that  the  original  system  is  weakly  observ¬ 


able  and  controllable,  i.e.. 


°i^l>  >>  for  *U 

ai^l^  >>  °i  * 


(6.4.6) 


Then  if  IP^I  is  small,  i.e.,  the  weakly  controllable  and  observable  subspaces 
are  near  each  other,  then  the  system  is  nearly  balanced.  This  is  equivalent 
to  asking  that  K  be  block  diagonal.  To  see  this,  we  note  from  (6.4.4)  that 
the  weakly  controllable  subspace  is 
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WC 


(6.4.7) 


Then  we  wish  to  estimate  how  close 2?wc  is  to  an  K- invariant  subspace.  Apply 
Theorem  3.2.3  to  (6.4.5).  Using  singular  value  perturbation  theorems  [69], 
we  have 

°i(Kl  +  P2K2P2)  "  Ci(Kl)  +£<P2K2P2) 

(6.4.8) 

°i(K2  +  P2KlP2)  -  °i(K2)  +  5(P2K1P2) 

£(P2K2P2}  “  2£(P2)2.(K2) 

5(P2K1P2)  -  25(P2)5(K1). 

«  A  A 

Thus,  if  o(P  )  -  |P2I  is  small  and  (6.4.6)  holds,  sep(K^,K^)  will  be  nonzero. 
For  a  bound  on  we  have 

1K2!  <  21  cos  01 -I P2I  (IK1( K2B ) .  (6.4.9) 

This  again  shows  that  a  small  leads  to  a  nearly  block  diagonal  K. 

Combining  (6.4.8)  and  (6.4.9)  we  obtain  the  estimates  in  Theorems  3. 2. 2-3. 2. 3 


It  is  interesting  to  note  that  a  separation  in  o(K^)  and  oCK^) 
decreases  the  magnitude  of  allowable  angles  between  the  weakly  controllable 
and  observable  subspaces.  (This  separation  increases  the  bound  in  (6.4.9).) 

This  is  to  be  expected  since  the  discarded  subspace  must  be  both  weakly  con¬ 
trollable  and  observable. 

The  relationship  between  near  unobservability  and  weak  observability 
was  established  in  Section  3.4.  Roughly,  the  system  is  weakly  observable  if 
the??[C]  is  near  the  fast  subspace.  A  dual  analysis  of  controllability  would 
yield  that  the  system  is  weakly  controllable  if  the  <R [ B ]  is  near  the  slow 
subspace.  Both  weak  controllability  and  observability  require  that  the  slow 
and  fast  subspaces  be  nearly  orthogonal.  These  properties  characterize  a 
balanced  system  which  will  reduce.  However,  note  that  if,  say,  o^C K)  are  closely 
grouped  and  a^(Q)  are  separated,  then  the  system  will  decompose  according  to 
the  controllability  criterion.  A  dual  situation  also  occurs. 

6.4.2.  External  analysis 

It  should  be  noted  that  this  model  reduction  technique  is  based  on 
internal  properties  rather  than  external  properties  [ 10]  such  as  impulse 
response 

H(t)  *  CeAtB.  (6.4.10) 


This  is  in  contrast  to  Mitra's  method  which  is  based  on  the  systems  impulse 
response.  To  see  this,  suppose  that  (A,B,C)  in  (6.4.10)  are  given  by  (6.3.5) 
where  X  is  selected  as  in  (6.3.18).  Then  compute 


105 
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Doing  the  computations  in  (6.4.11)  we  have 

J  "  C1Q1C1  +  C2Q3C2*  (6.4.13) 

Next  observe  that  since  Q  is  block  diagonal, 

AQ  +  QAT  -  -BBT  (6.4.14) 

decomposes  as 

F1Q1  +  Q1F1  "  "G1G1  (6.4.15a) 

F2Q2  +  Q1F3  “  "G1G2  (6.4.15b) 

F4Q3  +  Q3F4  "  “V1**  (6.4.15c) 

Hence,  we  can  again  measure  the  error  between  the  impulse  responses  of  the 

full  and  reduced  order  model  by  imposing  a  measure  on  (6.4.13). 

Ue  can  make  a  few  qualitative  judgments  based  on  (6.4.12d)  and 
(6.4.13).  Both  weak  controllability  and  observability  enter  in  two  ways. 
First,  they  enter  geometrically  through  IP^I  “*s^-n  »  i*l,2.  Here  0^ 

measures  the  angles  between  and  the  weakly  observable  space  and  6^  measures 
the  angels  between  the  weakly  observable  space  and  the  weakly  controllable 
space.  Secondly,  they  enter  through  the  eigenvalues  associated  with  each 
space.  That  is,  the  eigenvalues  of  K  enter  through  and  (see  (6.4.3)) 
and  the  eigenvalues  of  Q  enter  through  Qy 

There  are  other  ways  to  measure  the  error  when  balancing  is  used 
to  obtain  a  reduced  order  model  [11].  See  Section  6.5.2. 


6.5.1.  Formulation 


All  reduced  order  models  deviate  from  the  true  system  In  some  way. 
There  are  many  ways  of  evaluating  this  error  as  is  evidenced  from  this 
chapter.  Evaluation  of  a  reduced  order  model,  then  depends  on  the  evaluation 
of  the  error.  In  an  effort  to  put  these  methods  in  a  common  framework, 
Skelton  [  11]  has  suggested  evaluating  a  candidate  reduced  order  model  by 
identifying  its  contribution  to  a  cost  function.  Presented  in  the  framework 
of  Chapter  5,  the  residual  subsystem  which  contributes  little  to  the  cost 
is  truncated. 

Suppose  that  u(t)  in  (6.3.1a)  is  a  zero  mean  Gaussian  process  with 

covariance 

E{u(T)uT(t)>  -  S6(t-r).  (6.5.1) 

Define  a  cost  function  for  (6.3.1a)  as 

V  -  lim  E(xT(t)Qx(t)} 

-  lim  E{xT(t)CTCx(t)} 

t-*- 

-  lim  E{ly(t)l2}  (6.5.2) 

t-M» 

where  C  is  a  square  root  of  the  nxn  symmetric  positive  semidefinite  matrix 
Q.  (This  illustrates  the  origin  of  C  even  in  other  model  reduction 
contexts.)  It  is  easily  shown  [70]  that 

V  -  tr  KBSB1  (6.5.3) 

where  K  is  the  observability  gramian  of  (6.3.1)  (see  (3.4.4c)). 
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Consider  (6.3.1)  where  (6.3.1b)  is  replaced  by 


(6.5.4) 


With  respect  to  this  partitioning,  we  decompose  the  cost  as 


T  '  T1  +  V2 


v  «  V  +  V 
1  11  12 


(6.5.5) 


tr  K^BjSB^  1-1,2. 


If  >>  V^,  this  suggests  that  we  take  (A^.B^.C^)  as  a  reduced  order  model  as 
discussed  In  Section  5.2.  However,  In  general  the  cost  associated  with  the 
reduced  order  model  will  not  be  because  of  the  coupling  In  (6.3.1)  and 
(6.5.5).  Therefore,  define  the  error  cost  as 


<5 V  -  lim  E{ly(t)-y  (t)l  > . 

t-H»  r 


(6.5.6) 


From  (6.5.5)  and  (6.5.6)  various  Indexes  can  be  derived  [11]. 

The  general  approach  Is  to  select  special  representations  to 
derive  various  expressions  for  the  cost  decomposition  quantities  above.  This 
can  be  easily  done  in  our  framework  and  we  will  discuss  the  various  methods 
above  here.  To  evaluate  <5V  in  (6.5.6)  is  somewhat  more  difficult. 

Expressions  can  be  derived  by  augmenting  the  model  state  equations  (6.3.1a) 
with  the  reduced  order  model  equations  and  defining  an  appropriate  output 
equation.  Then  <5V  is  computed  as  in  (6.5.3).  The  details  are  found  in 
(11].  We  shall  discuss  6V  only  where  our  approach  leads  to  insight  and 
simplifications  (i.e.,  modal  methods). 


6.5.2.  Cost  decoupling 

To  gain  insight  into  the  contribution  of  various  subsystems  to 

the  cost,  it  might  be  useful  to  identify  coordinates  in  which  the  costs 

and  are  uncoupled,  i .  e .,  -  0,  ij*j .  It  is  easy  to  see  from  (6.5.3) 

that  two  sufficient  conditions  for  this  are  1)  K  is  block  diagonal,  or 
T 

2)  BSB  is  block  diagonal.  Either  of  these  conditions  can  be  accomplished 

by  an  orthogonal  transformation  as  in  (6.3.2).  Motivations  for  choosing 

these  transformations  can  be  obtained  from  Chapter  3  and  Section  6.4 

Intuitively,  this  must  be  connected  to  weak  observability.  In 

fact,  the  connection  is  explicit  if  we  reformulate  the  problem  by  assuming 

that  u(t)  -  0  and  the  initial  condition  has  a  zero  mean  Gaussian  distribution 

T 

with  covariance  BSB  .  If  the  cost  function  is  taken  to  be 


V  -  /  ly(t)ldt,  (6.5.7) 

0 

then  its  value  is  given  in  (6.5.3).  Hence,  each  cost  component  measures 

the  observability  of  that  subsystem  when  the  initial  states  are  distri- 
T 

buted  as  BSB  (rather  than  I  as  assumed  in  Chapter  3) .  If  K  is  block 
diagonalized,  truncating  the  residual  system  corresponds  to  eliminating 
the  weakly  unobservable  states. 

It  should  be  emphasized  that  the  error  cost  6V  is  not  V^.  This 
cost  must  be  determined  from  the  full  error  model  [ ll] .  This  corresponds  to 
internal-external  discussion  of  Section  6.4. 

To  evaluate  the  error  cost  6V  requires  that  the  reduced  order  model 
be  stable.  By  block  diagonalizing  K,  this  reduced  order  model  is  almost 
always  stable  [ 10] •  An  even  stronger  result  is  proved  in  [  71] . 


Block  diagonalizing  BSB  haa  been  called  disturbance  decoupling. 

X 

It  is  possible  to  simultaneously  block  diagonalize  both  K  and  BSB  .  The 

required  transformations  are  the  transformations  needed  for  balancing 

T 

(Section  6.4,  (6.4.1)-(6.4.4)  where  BSB  replaces  Q) .  In  this  form,  it  is 
easy  to  see  that  the  truncated  states  should  be  weakly  observable  states 
which  are  not  disturbed  too  much. 

6.5.3.  The  GHR 

A  second  obvious  choice  of  representation  is  the  GHR  (6.3.1),  so 
that  the  reduced  order  model  is  the  aggregate  subsystem.  A  real  issue  here 
is  the  stability  of  the  reduced  order  model  since  it  is  not  guaranteed. 
However,  suppose  the  system  is  nearly  unobservable.  Then  by  Theorem  3.3.3, 
A(A^)  approximates  a  subset  of  A  (A).  In  this  case  the  reduced  order  model 
should  be  stable.  Bounds  can  be  derived  using  the  results  in  Sections  3.3 
and  3.4. 

The  relationship  of  these  coordinates  to  cost  decoupled  coordinates 
follows  from  the  discussion  of  near  unobservability  and  weak  observability. 
Here  again  it  is  easy  to  see  that  the  weakly  observable  states  should  not 
be  heavily  distributed  to  produce  a  good  reduced  order  model. 

6.5.4.  Modal  methods 

Lastly,  consider  the  modal  methods  of  Section  6.3.  Assume  that 
the  system  is  represented  as  in  (6.3.5a)  following  transformations  (6.3.2) 
and  (6.3.4).  In  these  coordinates  it  is  not  easy  to  give  a  simple 
expression  for  the  component  costs — hence,  it  is  not  obvious  a  priori  how  to 


select  P  in  (6.3.2)  (in  contrast  to  the  above  two  methods). 


These  coordinates  simplify  the  calculation  of  6V.  Indeed,  the 
error  model  is  given  by  (6.3.5a)  with  output  equation  (6.3.7).  With  respect  to 
this  representation,  the  observability  gramian  of  (6.3.5a)-(6.3.7)  is 


where 


.-fi  b] 

LK2  *A 

*3F22  +  F22F3  "  "P12P12 

*2F2  +  FUF2  -  -f2lh 

Vn  +  *11*1  -  -52?21  -  '4*2 


(6.5.8) 


(6.5.9a) 

(6.5.9b) 

(6.5.9c) 


which  is  the  result  in  [  11] .  Now  the  error  cost  is  calculated  from 


(6.5.10) 


In  developing  the  modal  representation  (6.3.5),  the  submatrix  X 
in  the  transformation  (6.3.4)  was  not  specified.  We  see  now  that  it  can  be 
used  in  two  ways  here  which  lead  to  simplifications.  First,  X  can  be  chosen 
such  that  there  is  disturbance  decoupling.  For  instance,  if  is  nonsingular, 
we  can  choose  X  such  that  Gj^O.  However,  the  motivation  for  this  seems  to 
be  only  computational  simplicity. 

The  second  choice  of  X  would  be  such  that  F^*®-  Then  the 
coordinates  in  (6.3.5)  are  closer  to  the  well-known  modal  coordinates.  In 
this  case  the  solutions  to  both  (6.5.9b)  and  (6.5.9c)  are  K  ■^“O.  Thus, 


the  cost  reduces  .to 


CHAPTER  7 


THREE  CONTROL  COMPONENT  DESIGN 

7.1.  Introduction 

Having  introduced  the  subspaces  £ ^  in  Chapter  3,  we  studied  their 
behavior  under  state  feedback  in  Chapter  4.  The  intent  of  that  analysis  was 
to  lay  the  foundation  for  the  use  of  these  subspaces  in  closed  loop  design. 

In  this  chapter  we  introduce  a  control  design  scheme,  called  Three  Control 
Component  Design  (TCCD) . [2,28,72] ;  based  on  the  closed  loop  subspaces  £ 

We  then  see  how  this  procedure  relates  to  several  control  design  problems. 

The  TCCD  is  derived  from  the  system  representation  produced  by  MCA. 
Hence,  we  immediately  have  a  geometrical  interpretation  of  the  TCCD.  This 
allows  us  to  connect  it  with  several  known  [ 3  ]  results  and  extend  these 
ideas.  However,  a  deeper  insight  is  also  gained  by  considering  MCA  in  a 
state  feedback  context;  the  TCCD.  First,  the  TCCD  establishes  an  a  priori 
control  structure  which  reflects  a  hierarchy  of  design  goals  by  giving 
priority  in  the  design  procedure  to  the  control  component  used  to  meet  the 
primary  design  objectives.  This  is  done  by  integrating  the  control  objectives 
into  the  GHR.  Secondly,  the  TCCD  explicitly  identifies  a  reduced  order  model 
which  is  used  to  meet  the  primary  design  objectives.  Thus  reduced  order 
modeling,  implying  a  reduction  in  control  computation,  is  integrated  directly 
into  the  design  process.  Finally,  the  combination  of  these  two  aspects  into 
an  overall  design  procedure  allows  for  the  evaluation  of  the  trade-off 
between  the  order  of  the  reduced  order  model  and  the  complexity  of  the 


control  computation. 


The  decomposition  of  the  control  design  in  the  TCCD  is  directly 
related  to  the  system  decomposition  discussed  in  Chapter  S.  Hence,  this 
makes  it  useful  for  decentralized  control  of  interconnected  subsystems. 

While  the  TCCD  applies  to  any  of  the  system  structures  in  Section  5.3,  it 
is  discussed  here  for  systems  connected  through  their  outputs.  However, 
the  approach  here  can  easily  be  used  for  any  of  the  other  structures  in 
Section  5.3. 

When  the  TCCD  is  used  in  this  special  context,  it  takes  on 
additional  properties.  The  information  and  control  structure  is  exploited 
to  produce  a  control  scheme  which  is  hierarchical  and  partially  decentralized. 
The  control  problem  is  decomposed  into  a  global  problem,  which  is  a  coordi¬ 
nator  problem  for  the  interaction  variables,  and  local  control  problems 
which  lead  to  a  decentralized  design.  Through  this  analysis  we  are  able  to 
identify  what  models  are  necessary  for  computing  the  control,  and  what 
information  exchange  is  necessary  for  implementing  the  control. 

This  approach  to  control  of  an  interconnected  system  is  different 
from  most  other  schemes  [51,73-76].  First,  the  scheme  is  not  necessarily 
decentralized.  If  the  interactions  between  subsystems  are  strong  then  it 
may  be  justified  to  relax  the  decentralization  constraint  (if  possible). 

The  control  scheme  proposed  here  specifically  identifies  the  control 
component  associated  with  the  interactions  and  allows  for  a  (partially) 
centralized  design.  Secondly,  the  interaction  variables  are  given  priority 
in  the  design  while  the  local  control  is  computed  only  after  the  inter¬ 
action  variables  are  compensated.  This  is  exactly  opposite  of  many  decen¬ 
tralized  schemes  which  assign  the  local  controls  first  and  then  compensate 
for  the  interactions  [51,73,76].  Finally,  the  control  scheme  is  based  on  the 


information  and  control  structure  and  not  on  the  physical  subsystem 
structure  directly  as  are  most  other  schemes  [51,73-76]. 

If  the  purpose  of  the  state  feedback  matrix  is  to  place  the  closed 
loop  poles,  then  the  TCCD  decomposes  this  one  large  problem  into  two  smaller 
problems.  It  is  then  left  to  the  designer  to  choose  his/her  favorite  design 
technique.  However,  here  we  also  discuss  the  use  of  the  TCCD  in  an  optimal 
control  framework  [2].  This  analysis  when  combined  with  interconnected 
systems  has  applications  in  dynamic  games  [2].  We  discuss  a  Pareto  game 
here . 

The  discussion  of  the  TCCD  in  decentralized  control  is  a  straight¬ 
forward  application  of  these  ideas.  As  a  novel  use  of  this  approach,  we 
discuss  a  noninteraction  problem.  By  combining  the  decomposition  of 
Chapter  5  directly  with  the  TCCD,  we  are  able  to  solve  the  problem  of 
decoupling  a  system  with  static  output  feedback.  While  solutions  to  this 
problem  are  known  [3,77-82],  the  procedures  here  bring  fresh  insight  and  new 
interpretations  to  the  issues  involved  here. 

Finally,  the  TCCD  is  applied  to  a  class  of  nonlinear  systems  [83]. 
The  main  idea  is  to  extend  the  concept  of  invariant  structure  of  linear 
system  theory  [44]  to  a  class  of  nonlinear  systems  characterized  by 
arbitrary  dynamics  and  a  collection  of  static  nonlinearities,  and  to 
determine  a  partial  invariant  structure  of  such  nonlinear  systems  and  its 
basic  characteristics.  A  number  of  significant  gains  follow  from  such  a 
development.  First  is  an  explicit  use  of  nonlinear  system  structure,  loca¬ 
tion  of  nonlinearities,  and  structure  of  inputs  into  the  nonlinearities  in 
the  classification  of  nonlinear  control  systems.  Second  is  the  extension 


of  structural  Invariants  defined  for  linear  systems  to  a  wide  class  of 
nonlinear  systems,  and  the  use  of  these  invariants  in  the  classification  of 
nonlinear  system  types.  Third  is  a  natural  decomposition  of  the  control 
structure  in  nonlinear  systems  into  three  control  components:  the  first  is 
a  compensatory  component  that  brings  out  the  invariant  structure  and  reduces 
the  nonlinear  system  to  a  tandem  configuration  of  an  inherently  nonlinear 
subsystem  forcing  a  residual  linear  subsystem  through  static  nonlinear  inter¬ 
connections;  the  second  is  a  (possibly  nonlinear)  control  component  that 
solves  the  synthesis  problem  associated  with  the  inherently  nonlinear  part 
of  the  system;  the  third  is  the  residual  control  component  that  shapes  the 
dynamics  of  the  residual  linear  system.  It  is  stressed  that  the  major  gain 
is  in  the  decomposition  of  the  nonlinear  synthesis  problem  since  nonlinear 
synthesis  need  be  considered  only  in  providing  controls  adequate  for  the 
inherently  nonlinear  part  of  the  system. 

This  chapter  is  organized  as  follows.  Section  7.2  discusses  the 
basic  properties  of  the  TCCD.  These  follow  directly  from  the  results  in 
Chapter  4.  The  TCCD  is  then  extended  to  interconnected  systems  in  Section 
7.3.  In  Section  7.4  the  TCCD  is  discussed  in  an  optimal  control  framework. 
Section  7.5  discusses  output  decoupling.  Section  7.6  extends  these  ideas 
to  nonlinear  systems. 

7.2.  3asic  Properties 
7.2.1.  Structure 


In  this  section,  we  outline  the  basic  principles  behind  the  TCCD. 
In  fact,  the  basic  properties  follow  immediately  from  MCA.  We  assume,  first 


Chat  Che  system  (C.1)-(C.2)  has  been  transformed  by  MCA  to  yield 


With  respect  to  this  basis  consider  the  feedback  law 


(7.2.1a) 

(7.2.1b) 
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which  we  have  separated  into  three  components. 


The  first 


component  is 


(7.2.3) 


where  is  chosen  such  that 

CR  +  BjKj  -  0.  (7.2.4) 

MCA  guarantees  the  existence  (and  uniqueness)  of  K^.  With  Kp  chosen  as  in 
(7.2.4),  the  closed  loop  aggregate  subsystem  will  not  depend  on  the 
residual  states  x^,.  Thus,  we  call  this  first  component  the  decoupling 
control . 

By  specifying  the  decoupling  control  as  in  (7.2.4),  the  aggregate 
subsystem  becomes  a  reduced  order  model  for  the  output  variables  y.  If 
the  primary  design  objectives  are  given  for  these  variables,  then  these 


objective*  can  be  met  by  specif lying  the  second  component  of  the  control 


(7.2.5) 


called  the  aggregate  control. 

The  decoupling  control  has  reduced  the  closed  loop  system  to  a 
tandem  configuration  of  the  aggregate  subsystem  driving  the  residual  sub¬ 
system.  Once  the  aggregate  control  (7.2.5)  has  been  selected,  we  can  use 
the  third  control  component 
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the  residual  control,  to  stabilize  and  control  the  residual. 

Thus  the  TCCD  is  a  decomposition  of  the  control  law  based  on  the 
information  and  input  structure.  The  design  is  hierarchical  in  that  the 
aggregate  control  can  be  selected  freely  to  meet  the  primary  design  objec¬ 
tives  but  the  residual  control  depends  on  the  aggregate  control  and  can  only 
be  computed  after  the  aggregate  control. 

The  geometric  interpretation  of  MCA  gives  us  an  immediate  geo¬ 
metric  interpretation  of  the  TCCD.  The  selection  of  in  (7.2.4)  provides 

K  * 

the  essential  construction  of  a  state  feedback  matrix  K  such  that  £^  *  £  . 
Another  way  of  saying  this  is  that  K  makes  £*  closed  loop  invariant.  It  is 
clear  from  (7.2.1)  that 


is  a  closed  loop  unobservable  subspace.  Since  L  is  Che  supreoal  such  sub¬ 
space  (Proposicion  4.2.4),  Che  decoupling  concrol  in  (7.2.4)  is  making  Che 
syscem  maximally  unobservable  [4].  Therefore  we  will  call  Chis  choice  of 
decoupling  concrol  Che  maximal  decoupling  concrol.  We  will  discuss  ocher 
choices  of  decoupling  conCrols  below.  However,  nexc  we  turn  Co  propercies 
of  Che  aggregace  and  residual  subsyscems  generaCed  by  maximal  decoupling. 

If  Che  maximal  decoupling  concrol  is  used  to  aggregace  Che  syscem, 
che  residual  scace  space  is  1/* .  Since  V*  is  unique  [  3  ,  Theorem  4.1],  Che 
corresponding  reduced  order  model  is,  in  chis  sense,  unique.  As  che  design 
of  che  aggregace  and  residual  concrol  componencs  involves  scandard  design 
procedures  for  che  aggregace  and  residual  subsystems  respectively,  we  will 
examine  the  stabilizability  of  these  subsystems  and  see  how  they  relate  to 
the  original  system  properties.  This  analysis  applies  to  certain  steps  of 
the  synthesis  procedure  described  above.  The  final  goal  is  to  produce  a 
state  feedback  law.  Hence,  certain  aspects  of  reduced  order  models  are  not 
relevant  here,  such  as  stability  of  the  reduced  order  model.  We  will  be 
interested  in  those  structural  properties  that  relate  to  the  overall  synthesis 
procedure. 


7.2.2.  Aggregate  subsystem 

Let  <A|B>  denote  the  reachable  space  of  (7.2.1)  and  <A  |B  >  the 

G  G 

reachable  space  of  the  aggregate  subsystem  generated  by  the  maximal 
decoupling  control.  Furthermore,  let  Z±C Z  denote  the  state  space  of  the 
aggregate  subsystem. 

Proposition  7.2.1:  [28]  <A|B>  fi  r  »  <A  !b„>. 

1  G  G 

Proof:  Consider  the  following  system  which  aggregates  following  two  cycles 
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y(y) 


(7.2.8) 


From  our  assumption  and  properties  of  MCA,  it  follows  that  and  are 
nonsingular  matrices.  Hence,  the  maximal  decoupling  control  is  computed  to 
cancel  the  subblocks  and  F^.  Since  the  reachable  space  is  the  same  for 
the  systems  (7.2.1)  and  (7.2.8),  denote  the  system  matrices  in  (7.2.8)  by 
(A,B,C).  Similarly,  denote  the  aggregate  subsystem  (generated  by  the 
maximal  decoupling  control)  by  (F,G,H).  Let 
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With  this  notation,  we  have  that 

<Aj  B>  -  <AiBTU1>  +  <A|&<2>.  (7.2.10) 

From  the  special  representation  in  (7.2.8),  it  follows  that 

^l.n  <Al  B33^2>2  "  ?13B33V  F23B33Lf2’  (7.2.11) 


Zin<A|B331<2>2CB^lC<AGlBG> 


(7.2.12) 


We  conclude  Chat 


*in<AlB33VC<AGlV 


(7.2.13) 


and  so  the  result  follows  from  (7.2.10). 

The  special  system  in  (7.2.8)  contains  all  of  the  relevant  structure 
for  Proposition  7.2.1.  An  easy  Induction  argument  extends  the  result  to 
all  systems  (7.2.1).  □ 

Thus  the  aggregate  subsystem  inherits  its  reachability  properties 
from  the  original  system  and  these  properties  are  not  altered  by  this  choice 
of  decoupling  control.  We  now  given  an  interpretation  of  the  aggregate's 
unreachable  modes. 

Definition  7.2.2  [84]'*  The  input  decoupling  zeros  are  the  roots  of  the 


invariant  polynomials  of  [sI-A,Bj.  The  output  decoupling  zeros  are  the 
roots  of  the  invariant  polynomials  of 


m  • 


The  system  zeros  are  the  invariant  zeros  together  with  those  decoupling 
zeros  not  already  included  in  the  Invariant  zeros. 

Proposition  7.2.3:  Let  the  system  (7.2.1)  be  aggregated  by  the  maximal 
decoupling  control.  Then  the  eigenvalues  of  the  unreachable  modes  of  the 
aggregate  are  input  decoupling  zeros.  Hence,  they  are  system  zeros  which 
are  not  invariant  zeros. 


Proof :  The  first  statement  follows  easily  from  Definition  7.2.2  and 
Proposition  7.2.1.  Since  all  the  Invariant  zeros  are  in  the  residual, 
(Proposition  4.2.7,  see  discussion  below),  the  second  statement  follows. 
Porter  [85]  has  given  a  similar  description  of  system  zeros  which  are  not 
invariant  zeros. 

7.2.3.  The  residual  subsystem 

In  fact,  the  residual  subsystem's  properties  are  described  in 
Section  4.2.  From  Proposition  4.2.5,  the  controllable  subspace  of  the 
residual  is  exactly  A*,  the  supremal  controllability  subspace  in  7?[C]. 
Furthermore,  we  know  that  some  of  the  residual's  poles  are  fixed  by  the 
maximal  decoupling  control  and  that  these  poles  are  exactly  the  invariant 
zeros  (Proposition  4.2.7).  Recall  that  we  interpreted  the  maximal  decoupling 
control  as  making  the  system  maximally  unobservable.  This  has  been  inter¬ 
preted  as  canceling  some  of  the  system  poles  by  the  invariant  zeros  [86-89] 
and  we  recover  those  results  here. 

The  TCCD  allows  us  to  integrate  all  of  the  structural  results  of 
this  section  directly  into  a  control  synthesis.  This  allows  us  to  evaluate 
the  procedure  as  we  work  through  it  and  modify  it  to  take  into  account 
important  structural  properties  of  the  system.  In  particular,  we  note 
that  an  unstable  invariant  zero  will  result  in  an  unstable  design.  Next 
we  will  discuss  ways  to  structurally  modify  the  aggregate  and  residual  sub¬ 
system  to  circumvent  this  and  other  problems. 

7.2.4.  Alternative  decoupling  strategies 

In  the  most  general  terms,  the  TCCD  induces  a  system  decomposition. 
Maximal  decoupling  represents  an  extreme  case  of  this  decomposition.  Other 


decoupling  controls  will  produce  other  decompositions.  The  different 
decompositions  possible  reflect  the  designer's  freedom.  One  particular  use 
of  this  freedom  is  discussed  in  detail  below. 


As  pointed  out  above,  the  maximal  decoupling  control  will  produce 
an  unstable  design  if  one  of  the  invariant  zeros  is  unstable.  This  situa¬ 
tion  may  be  corrected  by  choosing  a  decoupling  control  other  than  the 
maximal  decoupling  control.  This  corresponds  to  selecting  a  feedback  matrix 
L  such  that  The  geometrical  aspects  of  this  problem  were  discussed 

in  Section  4.2.  We  shall  apply  that  analysis  to  the  design  problem  at 
hand,  i.e.,  how  to  choose  a  decoupling  control  to  produce  a  stable  residual. 

Since  we  are  interested  in  the  spectrum  of  the  residual  in  the 
factor  space  mod  fl*,  we  shall  assume  Then  (7.2.1)  becomes 


(7.2.14) 


It  is  easy  now  to  compute  the  decoupling  control.  First,  identify  the 
invariant  subspace  associated  with  the  stable  invariant  zeros  by  finding 
a  nonsingular  matrix  T  such  that 

(7.2.15) 

Here,  the  eigenvalues  of  Ag  are  the  stable  invariant  zeros.  Such  a  trans¬ 
formation  always  exists  since  the  spectra  of  Ay  and  Ag  are  disjoint.  We 
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u 
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interpret  this  as  a  change  of  basis  in  state  space  as 


We  can  now  immediately  compute  the  decoupling  control  as 
u  "  -BG1cR252  +  v- 


(7.2.16) 


(7.2.17) 


(7.2.18) 


We  note  that  the  aggregate  subsystem  in  (7.2.17)  again  inherits 
the  reachability  properties  from  the  original  system.  The  proof  is  similar 
to  the  proof  of  Proposition  7.2.1  and  is  omitted.  Furthermore,  by  using 
the  results  of  Section  4.2  we  can  see  directly  that  the  invariant  zeros  of 
the  aggregate  subsystem  are  exactly  the  unstable  invariant  zeros  of  the 
original  system.  Finally,  the  residual  dynamics  are  completely  fixed,  but 
stable  by  construction.  Hence,  the  TCCD  with  decoupling  control  (7.2.13) 
will  produce  a  stable  design.  The  extension  of  these  ideas  to  systems  in 
which  ft* t  0  is  straightforward  and  discussed  in  Section  4.2.5. 

The  discussion  above  is  a  particular  case  of  a  general  procedure 
for  constructing  alternative  decoupling  strategies.  The  analysis  in 


Chapters  2-6  provides  the  general  background  to  use  this  freedom.  In 
particular,  the  results  in  Chapter  5  may  be  used  to  decompose  the  system 
along  physical  lines  that  are  still  compatible  with  the  TCCD.  Or  since  the 
aggregate  subsystem  is  a  reduced  order  model  of  the  output  variables,  the 
discussion  in  Chapter  6  provides  insight  for  a  good  selection  of  a  decoupling 
control . 


.3.  Control  of  Interconnected  Systems 


7.3.1  Systems  connected  through  their  outputs 


The  TCCD  is  based  on  the  system  decomposition  produced  by  MCA. 

Thus,  when  interconnected  systems  are  decomposed  by  11CA,  the  TCCD  should  have 
a  direct  application.  Indeed,  the  TCCD  can  be  applied  to  any  of  the 
decompositions  discussed  in  Section  5.3.  We  shall  discuss  its  application 
to  one  particular  structure  described  in  Example  5.3.3.  However,  the 
approach  here  will  generalize  to  any  system  structure  produced  by  MCA  or 
chained  aggregation. 

For  ease  of  presentation,  we  recall  the  system  structure  from 
Example  5.3.3  as 


(7.3.1) 
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By  rearranging  the  states,  we  see  that  (7.3.1)  consists  of  two  inter¬ 
connected  subsystems 


AGi,3-i 

BGRi,3-i 


_3-i 

y 


(7.3.2) 


Note  that  (7.3.1)  highlights  the  information  and  control  structure  in  the 
composite  system  while  (7.3.2)  emphasizes  the  physical  subsystem  structure. 
Equation  (7.3.2)  also  emphasizes  that  these  subsytems  are  interconnected 
through  their  outputs.  So  we  interpret  the  state  variables  y*  as  system 
wide  interconnection  variables.  A  model  of  these  variables  is  given  by  the 
aggregate  subsystem  in  (7.3.1).  The  remaining  states  x*  are  interpreted 
as  local  state  variables.  The  local  nature  of  these  states  is  reflected  in 
the  block  diagonal  structure  of  the  residual  dynamics. 

Implicit  in  the  TCCD  is  a  ranking  in  the  design  goals.  It  is 
assumed  that  greater  priority  is  given  to  the  output  variables,  i.e.,  the 
aggregate  subsystem.  If  in  (7.3.1)  we  are  primarily  concerned  with  the 
system  wide  interaction  variables,  then  the  TCCD  has  a  natural  application. 
Indeed,  it  has  other  interesting  features. 

First  consider  the  decoupling  control.  The  block  diagonal 
structure  in  (7.3.1)  yields 
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i=i,: 


(7.1.3) 


This  control  component  is  decentralized  in  that  it  can  be  implemented 
locally.  Once  this  is  done,  the  interaction  variables  are  modeled  by  the 
aggregate  subsystem.  The  aggregate  control  is  given  by 


(7.3.4) 


Further  structure  may  be  imposed  on  (7.3.4)  as  fits  the  problem.  The  aggre¬ 
gate  control  reflects  the  essence  of  the  interconnected  nature  of  the  problem. 
The  aggregate  subsystem  is  the  model  each  subsystem  needs  to  compute  its 
aggregate  control.  The  control  law  in  (7.3.4)  identifies  the  information 
exchange  necessary  to  implement  the  scheme. 

The  residual  control  is  given  by 
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The  component  of  the  residual  control  that  shapes  the  dynamics  of  the 
residual  subsystem  is  again  decentralized.  The  feedforward  term  of  the 
aggregate  variables  can  be  decentralized  or  not  according  to  what  information 
is  locally  available. 

7.3.2.  Two  area  power  system 

In  Section  5.4  we  considered  the  decomposition  of  a  two  area  power 
system.  The  final  decomposition  is  given  in  (5.4.11).  A  quick  comparison 
with  (7.3.1)  shows  that  this  power  system  exhibits  the  structure  we  have 
been  discussion  in  this  section. 

The  application  of  the  TCCD  to  (5.4.11)  is  straight  forward.  Each 


area  computes  its  own  decoupling  control  as 


a 

P 


I 
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a1  -  K^x*  +  v1  (7.3.6a) 

kJj  -  'f2j/821  ±-U2  j  *  3, ... ,8.  (7.3.6b) 


That  is  to  say,  kJ  is  a  1x6  matrix  with  elements  given  by  (7.3.6b).  Thi9 
eliminates  the  aggregate  systems  dependence  of  the  residual  states.  Now  an 
aggregate  control  can  be  designed  to  regulate  the  power  flow  and  the  frequency 
deviations. 

The  last  step  is  control  of  the  residual  systems  in  (5.4.11b) 
using  u*.  This  is  a  standard  design  procedure  once  the  aggregate  control  is 
known . 
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7.4,  Optimal  Control  “1 

7.4.1.  Decomposition 

The  TCCD  can  be  applied  in  linear  quadratic  optimal  control 
problem  to  obtain  a  suboptimal  control  [2],  To  see  how  this  goes,  suppose 
that  the  system  is  represented  as  in  (7.2.1).  With  respect  to  this  basis,  let 
the  cost  function  be  given  by 


(7.4.1) 


with  all  the  usual  assumptions.  The  optimal  strategy  u*=K*x'  would  minimize 
J  for  all  initial  conditions. 


I 


We  will  consider  a  suboptimal  solution  based  on  the  TCCD. 
Parameterize  the  total  control  on  the  decoupling  component.  Write 


(7.4.2) 


Substituting  (7.4.2)  into  (7.4.1),  we  obtain  a.  cost  function  parameterized 
on  K^,  which  we  emphasize  by  writing  which  decomposes  as 

J(Kd  )  -  JA  +  JR  (7.4.3) 


where 
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(7.4.4a) 
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•u  R3u  dt. 


(7.4.4b) 


Since  the  decoupling  control  makes  the  system  unobservable,  the 
original  optimal  control  problem  decomposes  into  two  subproblems.  The 
first  is  defined  on  the  aggregate  subsystem  with  cost  function  J  (7.4.  i) . 
This  involves  the  solution  of  a  Riccati  equation  of  order  r.  Once  the 
aggregate  control  has  been  determined,  the  system  (7.2.1a)  becomes 
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(7.4.5) 


The  residual  control  is  now  computed  using  the  performance  index  (7.4.4b) 
in  conjunction  with  the  system  constraint  (7.4.5). 

This  scheme  is  suboptimal  because  it  is  parameterized  on  the 


decoupling  control  K_.  It  may  be  asked  when  this  solution  approaches  an 


optimal  solution.  We  make  the  following  observation.  Suppose  that 


Q^*  Q.j*  0  in  (7.4.1),  that  is,  only  the  outputs  are  penalized  and 
the  control  weighting  is  block  diagonal.  If  the  residual  subsystem  was 
unobservable  in  the  sense  that  C  *0,  the  optimum  solution  would  be  given 

K 

by  the  aggregate  solution.  Next  observe  that  the  decoupling  control  is 

-►0  as  C^-*-0.  Further, 

from  (74. 4. b)  we  have 


directly  proportional  to  CR  (see  (7.2.4)),  so  that 


h  /  uTR  u  dt 
0  J 


as 


*D 


(7.4.6) 


If  the  residual  subsystem  is  stable,  we  have  u->-0.  In  summary,  if  the  output 
equation  (7.2.1b)  is  derived  from  the  state  weighting  matrix  in  (7.4.1)  (Q=C  C) 
then  the  suboptimal  control  via  the  TCCD  approaches  the  optimal  control  as 
the  system  becomes  unobservable. 


7.4.2.  A  Pareto  game 

This  optimal  control  approach  to  the  TCCD  when  combined  with  the 
decentralized  control  of  Section  7.3,  has  applications  in  game  theory [2,55] . 
As  an  example,  consider  the  following  Pareto  game.  Assume  we  have  the 
system  structure  of  (7.3.1)  such  that  each  subsystem  (7.3.2)  has  associated 
with  it  a  performance  index  of  the  form  (7.4.1).  Each  player  (subsystem) 
is  to  choose  an  optimal  strategy  u*  *  K*x*  such  that  any  deviation  from  the 
optimal  strategy  causes  an  increase  in  one  of  the  cost  functions;  i.e., 
there  does  not  exist  a  strategy  pair  (K°,K°)  such  that 


with  strict  inequality  holding  for  some  i.  Thus  the  two  players  cooperate  to 
minimize  both  cost  functions. 


As  outlined  above,  each  player  selects  his  decoupling  control  and 
applies  it  locally.  Then  the  choice  of  the  aggregate  control,  the  coordinator 
problem  in  the  terminology  of  Section  7.3,  becomes  a  reduced  order  Pareto 
game  in  which  each  subsystem  has  a  cost  function  of  the  form  (7.4.4a).  The 


structure  of  the  aggregate  control  is  given  in  (7.3.4)  where  ■  ^A12*^ 


is  a  structural  constraint. 


Once  the  coordinator  problem  is  solved,  we  can  solve  the  residual 
problem.  Note,  however,  that  the  residuals  subsystems  in  (7.3.1)  are 
mutually  uncontrollable,  and  because  of  the  enforced  decentralization  of  the 
control,  this  structure  is  preserved.  Thus,  the  Pareto  game  defined  on  the 


residual  subsystems  in  (7.3.1)  decomposes  into  two  local  control  problems 


with,  say  for  i*l,  cost  function  (7.4.4b)  subject  to 


(7.4.8) 


Note  that  there  is  a  computational  savings  since  the  states  of  the  other 
residual  are  not  included  in  (7.4.8).  This  savings  may  be  considerable  if 
there  are  more  than  two  players. 


Output  Decouplins 


Having  introduced  the  TCCD  in  Section  7.2,  we  showed  how  it  could 


be  used  in  decentralized  control  (Section  7.3)  and  optimal  control 
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(Section  7.4).  He  mention  again  that  these  results  depend  on  the  system 
decomposition.  In  this  section  we  will  use  these  two  ideas,  decomposition 
and  the  TCCD,  in  a  slightly  different  way  in  a  standard  compensation  problem. 

By  output  decoupling  we  mean  computing  a  static  output  feedback 
control  law 


u  ■  Ky  +  Vv 


(7.5.1) 


such  that  in  the  closed  loop  system,  the  i-th  input  vector  controls  only 
the  i-th  output  vector.  This  implies  that  the  closed  loop  system  forms  a 
set  of  decoupled  subsystems.  That  is  to  say,  the  open  loop  system  consisted 
of  a  set  of  interconnected  systems  which  we  have  decoupled  by  the  feedback 
law  (7.5.1).  From  this  point  of  view,  the  main  problem  is  to  identify  the 
open  loop  interconnected  system  structure.  In  this  section,  we  shall 
apply  the  results  of  Chapter  5  to  solve  this  problem.  This  will  illustrate 
how  the  information  and  control  structure  can  be  used  in  compensator  design. 

The  problem  of  output  decoupling  has  been  of  long-suanding 
interest  [77-82].  Results  have  been  obtained  in  the  frequency  domain  [77] 
and  in  the  time  domain  from  both  geometric  [3,80,82]  and  matrix  [  78,79,81] 
analysis.  The  main  emphasis  here  is  on  the  approach  to  the  problem  anc 
the  corresponding  insight  obtained  into  the  structure  of  the  solution.. 

This  problem  illustrates  the  use  of  the  more  general  concepts  developed 
above. 

Given  the  system  (C.1)-(C.2),  suppose  that  the  output  vector  has 

1  2 

been  partitioned  into  two  subvectors  y  and  y  ,  i.e.,  (C.2)  becomes 


WL*J- 


(7.5.2) 


Now  we  want  to  decompose  the  system  into  two  subsystems  with  (7.5.2)  as 
their  respective  outputs.  To  this  end  apply  one  step  of  chained  aggregation 
to  obtain 


(7.5.3a) 


(7.5.3b) 


To  obtain  the  representation  in  (7.5.3b),  the  row  spaces  of  C1 

2 

and  C  in  (7.5.2)  must  be  independent.  This  represents  a  noninteraction  in 
the  information  structure.  Intuitively,  if  the  row  spaces  were  not  indepen¬ 
dent,  then  the  same  measurement  would  occur  in  both  outputs  and  they  could 
not  be  independently  controlled. 

Next,  to  identify  the  input  structure  overlap  we  apply  the  trans¬ 
formations  of  MCA.  In  addition  we  require  that  V  satisfy 
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(7.5.4) 


That  is  to  say,  the  input  transformation  V  not  only  identifies  the  null 
space  of  the  aggregate  input  matrix,  but  also  block  doagonalizes  it  in 
accordance  with  the  output  partitioning.  This  requires  that  the  input 


decompose  with  respect  to  the  given  output  structure.  This  is  a  necessary 
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condition.  Intuitively,  if  (7.5.4)  does  not  hold,  the  same  input  affects 
both  outputs. 

To  see  the  role  of  feedback  in  this  setting,  suppose  that  C  in 
(7.5.2)  is  monic  (or  the  outputs  are  "complete"  [3  ]).  In  that  case  the 
dimension  of  the  residual  state  space  in  (7.5.3)  is  zero.  Assuming  that 
(7.5.4)  holds,  (7.5.3)  reduces  to 


y  _  AG1  AG12  y  +  BGr  °  U 
y  .  _AGG2  AG2  .  ,y  .  -  °  ®G2.  . 


(7.5.5) 


To  decouple  the  system  (7.5.5)  we  would  like  to  cancel  A^,^  and  A^^*  This 
is  possible  if  and  only  if 


«tAG12]  Cfl[BGl] 


flIAG21]  c 


(7.5.6) 


Hence,  (7.5.6)  is  also  a  necessary  condition  for  decoupling. 


If  (7.5.6)  is  satisfied,  then  let 


AGi , 3-i  "  BGiNil 


i-  1,2. 


(7.5.7) 


In  this  form  it  is  obvious  that  (7.5.5)  is  two  interconnected  subsystems 
as  described  in  Example  5.4.4.  To  decouple  these  two  subsystems,  we  apply 
feedback 


•  ••  w  a 

U  -  -Nuy  . 


The  total  feedback  matrix  becomes 


(7.5.3) 


where  and  are  chosen  to  place  the  poles  of  the  subsystems.  Note 
that  the  feedback  matrix  has  two  distinct  functions;  i)  N^.  and  modify 
the  information  structure  of  the  system  and  li)  and  K „  change  the 
dynamics  of  the  subsystems. 

We  also  see  that  the  feedback  in  (7.5.9)  has  the  structure  of  the 

TCCD.  Consider  only  the  first  output,  y\  in  (7.5.5).  Then  the  off  diagonal 

block  in  (7.5.9),  i.e.,  (7.5.8)  with  i-1,  is  the  decoupling  control.  The 

aggregate  control  is  defined  by  and  the  residual  control  by  K^*  ^ 

2 

similar  interpretation  is  obtained  by  considering  the  second  output,  y  . 

Thus. far  we  have  identified  the  three  essential  components  of  the 
decoupling  problem.  First,  the  information  structure  must  be  nonoverlapping 
(existence  of  (7.5.3b)).  Secondly,  the  control  must  decompose  with  respect 
to  this  information  structure  (existence  of  (7.5.4)).  These  two  conditions 
establish  well  defined  subsystems  of  the  original  system.  Thirdly,  the 
interconnection  structure  must  be  of  special  form  (7.5.6).  This  guarantees 
the  existence  of  a  decoupling  feedback.  It  also  establishes  the  existence 
of  a  particular  interconnection  structure  between  the  subsystems.  We  shall 
see  that  these  three  conditions  reoccur  in  the  general  case. 

Now  suppose  that  the  dimension  of  the  residual  system  in  (7.5.3) 
is  not  zero  and  that  (7.5.4)  and  (7.5.6)  hold  (the  aggregate  subsystem 


decouples).  We  can  describe  the  residual  by  the  quadruple  . 

Here  we  can  think  of  B  representing  an  "active"  input  (one  available  for 

K 


control)  and  B 


GR 


a  "passive"  input. 


t 


With  this  distinction,  the  problem  of  decoupling  the  residual  is 


similar  to  decoupling  the  aggregate.  First  note  that  C_  cannot  be  modified 

K 

using  output  feedback.  So  we  simply  apply  the  procedure  described  above  to 

the  residual.  Two  cases  occur  as  BR  is  zero  or  not.  First  suppose  Brj*0. 

Then  the  second  step  of  chained  aggregation  is  applied  to  CR  and  the  input 

transformation  is  applied  to  B  .  Suppose  that  CL  is  monic  and  that  following 

R  R 

the  transformation  the  residual  system  has  the  representation 
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(7.5.10) 
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n  [Et]  -  0. 


To  obtain  (7.5.10),  it  is  first  necessary  that  the  rows  of  C 


R1 


are  independent  of  the  rows  of  C^.  Thus  the  information  structure  must 


continue  to  be  nonoverlapping.  Secondly,  B  must  be  block  diagonalizable; 

K 


i.e.,  the  input  structure  must  continue  to  decompose  according  to  the  informa¬ 
tion  structure. 

Now  to  decouple  the  system  in  (7.5.10),  we  would  like  to  cancel 
^l9’  ^21'  BGR12’  and  BGR21'  However*  the  residual  states  are  not  avail¬ 


able  for  feedback.  So  it  must  be  structurally  true  that  ~  ®  an^ 


The  outputs  are  available  for  feedback  giving  the  decoupling 

conditions 


«[Br,  ,  J  C  »?[B  ], 


i-  1,2 


(7.5.11) 


BGi,3-i  “  BRiNi2 * 


(7.5.12) 


The  relationship  In  (7.5.12)  again  defines  the  feedback  necessary  to 
decouple  the  system.  It  also  leads  to  the  subsystem  interconnection  structure 
in  Example  5.4.4. 

The  composite  feedback  matrix  continues  to  decompose  as  In  (7.5.9) 
into  two  distinct  control  functions.  Part  of  the  control  is  used  to 
(informationally)  decouple  the  subsystems.  The  remaining  freedom  in  the 
control  places  the  poles  of  the  decoupled  subsystems.  This  is  an  output 
pole  placement  problem. 

If  B  =0,  then  B_  plays  the  role  of  the  input  matrix,  i.e.,  the 

K  CjK 

input  transformation  is  defined  with  respect  to  B__.  Also  note  there  is 
no  possibility  for  feedback.  So  if  the  system  is  to  decouple  the  corre-^ 
sponding  matrices  must  be  structurally  block  diagonal.  This  again  leads 
to  the  subsystem  interconnection  structure  in  Example  5.4.4. 

If  CD  in  (7.5.3)  is  not  monic,  then  (7.5.10)  actually  will  split 
into  an  aggregate  and  a  residual.  The  above  analysis  is  then  applied  to 
the  aggregate.  If  all  the  conditions  hold,  the  analysis  is  repeated  for 
the  residual.  This  process  continues  until  the  system  is  decoupled. 

The  main  idea  behind  this  approach  is  to  untangle  the  information 
and  control  structures.  The  system  can  be  decoupled  when  these  structures 
decompose  in  the  proper  way.  This  leads  to  a  special  representation  of  the 
system  as  interconnected  subsystems.  These  results  are  summarized  in  the 


following. 
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Proposition  7. 5. It  [90]  The  system  (C.1)--(C.2)  can  be  decoupled  by  output  feed' 
back  (7.5.1)  If  and  only  If  It  can  be  represented  as  an  Input-output  Inter¬ 
connected  system  with  respect  to  the  given  output  partition.  □ 

The  above  analysis  not  only  tells  us  when  decoupling  is  possible, 
but  also  identifies  the  feedback  structure  which  accomplishes  decoupling. 

This  structure  includes  the  part  which  specifically  decouples  the  system  and 
the  remaining  freedom  to  place  the  closed  loop  poles .  Thus  this  analysis 
nicely  illustrates  the  close  Interrelationship  between  the  information  and 
control  structure  and  feedback  design. 

7.6.  Nonlinear  Systems 

7.6.1.  Preliminaries 

Consider  the  nonlinear  system 

x(t)  ■  Ax(c)  +  Bu(t)  +  Df(y(t))  x(0),  t>0  (7.6.1a) 

y(t)  -  Cx(t)  (7.6.1b) 

where  x6Rn,  u€  Rra,  and  y€  Rr.  The  nonlinearities  are  represented  by  the 
function  f  :  Rr-^R^  with  f(0)  *0.  Furthermore,  (A,B,C,D)  are  real  constant 
matrices. 

In  this  section  we  will  discuss  a  (partial)  feedback  invariant 
structure  based  on  linear  transformations  of  the  state  and  input  spaces  and 
linear  state  feedback.  This  is  motivated  by  the  desire  to  study  the  effect 
of  nonlinearities  on  linear  feedback  design.  To  do  this  we  start  with  a 
review  of  the  invariant  structure  of  the  linear  part  of  (7.6.1),  i.e.. 


x  -  Ax  +  Bu 


(7.6.2) 


! 

< 

L1 


' 


y  -  Cx. 


A  complete  set  of  Invariants  for  (7.6.2)  under  state  and  input  space  trans¬ 
formations  and  state  feedback  is  not  known.  However,  a  partial  list  is 
known  [44].  To  exhibit  this  invariant  structure,  we  use  MCA.  After  trans¬ 
formation  (7.6.2)  can  be  represented  as 


(7.6.3) 


where  C  ^Gll^  and7[[G^]*0.  Hence  linear  state  feedback  exists 

which  cancels  F^. 

In  Proposition  4.2.4  it  is  shown  that 


(7.6.4) 


and  in  Proposition  4.2.5  that 

«*  -  <F22|G22>.  (7.6.5) 

This  identifies  the  uncontrollable  modes  of  (^22,G22^  aS  C^e  invarianc  2eros 
(Proposition  4.2.6). 

The  uniqueness  of  Kf*  and  fi*  [  3  ]  identifies  the  pair  (F?7,G2n)  as 
related  the  the  invariant  structure  of  (7.6.2).  Indeed,  a  partial  list  of 


invariants  for  (7.6.2)  is  given  by  the  list  of  invariants  for  (F77,G77) 
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under  state  and  input  space  transformations  and  state  feedback.  These 
invariants  are  well  known  [  3 ]  and  consist  of  the  contrillability  indices 
of  (^22’^22^  P^us  t*le  eigenvalues  of  the  uncontrollable  modes.  As  just 
pointed  out,  these  eigenvalues  are  the  invariant  zeros  of  (7.6.2).  This 
is  the  invariant  structure  we  seek. 


7.6.2.  Invariant  structure 

We  will  extend  the  partial  invariant  structure  to  the  nonlinear 
system  (7.6.1).  Note  that  the  nonlinearities  depend  on  only  some  of  the 
state  variables.  (In  fact,  we  have  in  mind  systems  in  which  r  is  signifi¬ 
cantly  less  than  n.)  In  this  section  we  will  interpret  C  in  (7.6.1b)  as 
an  artificially  introduced  map  which  explicitly  identifies  the  dependent 
variables  in  the  nonlinearities.  In  this  way  we  identify  the  effect  of  the 
nonlinearities  on  the  linear  part  of  the  system. 

We  can  then  extend  the  analysis  of  the  last  section  directly  to 
the  nonlinear  system  (7.6.1).  By  defining  the  state  and  input  space 
transformations  with  respect  to  the  linear  part  of  (7.6.1),  this  system 
can  be  represented  as  (7.6.1) 


y  ■ 


(7.6.6) 


We  can  use  linear  state  feedback  to  cancel  F 


12  ’ 


Then  (7.6.6)  decomposes 


>1 


as  a  nonlinear  system 


driving  a  linear  system 


*r  "  F22xr  +  G22“  +  F21F  +  H2f^*  (7.6.8) 

Thus  the  invariants  of  the  linear  system  (7.6.8),  i.e.,  of  (^22,G22^  Form  a 
partial  listing  of  the  invariants  of  the  complete  system.  The  other  set  of 
invariants  are  then  given  by  the  nonlinear  system  (7.6.3),  but  they  are  not 
known  at  this  time. 

The  geometric  structure  of  the  linear  system  also  carries  over  to 
the  nonlinear  system.  The  idea  of  (A, B) -invariant  subsystems  has  recently 
been  extended  to  nonlinear  systems  [91].  Using  the  ideas  there,  it  is 
easy  to  show  that 


sp 


(7.6.9) 


is  an  (A, B) -invariant  manifold  for  the  system  (7.6.2). 


7.6.3.  Control  synthesis 

Clearly,  the  TCCD  is  useful  here.  To  apply  the  TCCD  to  a  nonlinear 
system,  we  first  transform  (7.6.1)  into  the  representation  (7.6.6).  This 
guarantees  the  existence  of  a  linear  (partial)  state  feedback 

u  =  +  v  (7.6.10) 

where  K_  is  chosen  such  that 


Fn  +  GiA  '  °- 


(7.6.11) 


i.e.,  the  decoupling  control.  This  yields  a  reduced  order  model  of  the 
output  variables  y  given  by  the  aggregate  subsystem 


*  ■  F11F  +  Gu5  +  Hif(y) 

y  -  Cxy. 


(7.6.12) 


Note  that  the  states  do  not  enter  into  (7.6.12)  so  that  a  control  can  be 
designed  for  the  variables  y  using  this  reduced  order  model.  The  variables 
xf  are  modeled  by  the  residual  subsystem 

*r  *  F22Xr  +  G22U  +  F21y  +  H2f(y)‘  (7.6.13) 

Once  the  (possibly  nonlinear)  aggregate  control  v=k(y)  has  been  specified 
for  the  system  (7.6.12),  we  can  design  a  residual  control  for  (7.6.13). 

This  synthesis  procedure  has  several  advantages.  First,  the 
decoupling  control  is  easily  computed  since  it  is  linear.  Second,  the 
truly  nonlinear  aspect  of  the  system  (7.6.1)  is  contained  in  the  aggregate 
subsystem.  Thus,  we  have  reduced  the  order  of  the  nonlinear  design  and 
isolated  its  effect  on  the  system.  While  a  control  synthesis  for  this 
subsystem  is  not,  in  general,  easy,  it  has  been  simplified  from  the  original 
problem.  Also  note  that  this  component  of  the  control  can  be  linear  or 
nonlinear  as  we  like.  Thirdly,  the  residual  subsystem  in  (7.6.13)  is  a 
linear  system.  Thus  the  control  of  this  system  can  be  carried  out  using 
any  one  of  a  number  of  standard  techniques. 

As  for  linear  systems,  this  procedure  does  not  restrict  the  control 
design  for  the  aggregate  subsystems,  but  it  imposes  the  invariant  structure 
on  the  residual  subsystem.  This  may  impose  an  unacceptable  desing  constraint; 
for  instance,  if  one  of  the  residuals  fixed  modes  is  unstable.  However,  by 


modifying  Che  aggregate  control  (7.6.10),  we  can  include  the  unstable  modes 
in  the  aggregate  system.  This  procedure  is  the  same  as  the  linear  case. 

See  Section  7.2.4. 


7.6.4.  Example 

In  this  section  we  will  illustrate  the  ideas  above  by  considering 
the  aircraft  landing  problem  descirbed  in  Dyer  and  McReynolds  [92].  The 
problem  is  to  design  an  automatic  landing  control  for  a  heavy  transport 
aircraft  during  the  final  landing  stage,  the  flareout  phase.  Typical  state 
equations  are 

x,  -  0.41x,  +  0.381x„  -  0.562x,  -  2.522x  +0.221f(xc) 

1  1  2  3  y  5 

x^  *  -0.066x^-  0.577x2  +  x^  -  0.05x^  -  0.992x2  -  0.395f(x^) 

x3  -  0.01^-  1.108x2-  0.822x3-  1.264x6-0.157x7-  3.544f(x5)  (7.6.14) 

x4  *  x3 

x_  *  -12.147  +  4.049 (x  -xn) 

J  4  2 


x7  *  (u2-x7>/2 

where  the  seven  elements  of  the  state  vector  x  are  as  follows: 

x^  =  increment  of  forward  speed  (ft/sec) 
x2  ■  increment  of  altitude  ange  (deg) 
x3  =  increment  of  pitch  rate  (deg/sec) 
x^  =  increment  of  pitch  angel  (deg) 
x.  =  height  (ft) 

x,  =  increment  of  elevator  angle  (deg) 


*7 


increment  of  throttle 


u^  -  increment  of  elevator  rate  (deg/sec) 
i>2  ■  input  to  throttle  actuator 
and  f,  the  nonlinear  ground  effect  term,  is 

f  -  400/(3x5+  100)-1.  (7.6.15) 


Ue  want  to  select  controls  u^  and  u^  such  that  x^  (the  forward  speed)  and 
x^  (the  height)  follow  a  prescribed  trajectory  while  x^  (the  pitch  angle) 
should  have  a  final  value  which  is  positive.  We  shall  take  the  simple  point 
of  view  that  x^,  x^,  and  x^  are  the  variables  of  primary  interest.  A 
complete  description  of  this  problem  can  be  found  in  [92]. 

We  shall  use  the  TCCD  to  decompose  this  problem  into  its  linear 
and  nonlinear  subsystems.  Equation  (7.6.14)  shows  that  the  nonlinearity 
depends  on  the  single  state  variable  x^.  We  formally  identify  this  structure 
by  introducing  an  output  equation 


y-  [0000  10  0]x.  (7.6.16) 

Combine  (7.6.16)  with  (7.6.14)  and  then  use  MCA.  The  resulting  decomposition 
is 
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(7.6.18) 


-x,  -  0.066x.  -  0.5777x.  -  -0.05x, 
7  1  4  € 


.lt^  +  u2. 


The  partitioning  in  (7.6.17)  corresponds  to  the  partitioning  in  (7.6.6). 

It  is  interesting  to  note  that  the  controls  seem  to  decompose 
naturally  with  respect  to  the  state  decomposition  in  (7.6.17)  in  the  sense 


that  affects  only  the  aggregate  subsystem,  and  the  first  control's  effect 


on  the  aggregate  is  an  order  of  magnitude  less  than  u^. 


With  this  in  mind,  the  first  step  is  to  compute  a  decoupling 


control  u,,  = 


Xr  +  V2 


-  (-.0826  -.8204  -1.0812  -.0666]. 


(7.6.19) 


When  this  control  is  applied  to  (7.6.17),  we  obtain  the  aggregate  subsystem 
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This  subsystem,  which  represents  the  nonlinear  part  of  the  model,  forms  a 
reduced  order  model  for  the  height  variable  y^  (»x^).  Thus  we  can  design 
a  control  v2*L(y)  so  that  the  aircraft  follows  in  the  prescribed  height 
trajectory. 

Following  computation  of  the  aggregate  control,  we  can  design  the 
residual  system  control  using  the  model 
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Since  this  system  is  controllable,  we  can  select  the  control  to  meet  the 
specifications  for  forward  speed  (x^)  and  pitch  angel  (x^). 


y 


I 


* 


The  residual  subsystem  (7.6.21)  also  contains  the  partial 
invariant  structure.  In  this  case,  the  linear  part  of  (7.6.21)  is  a  single 
input  controllable  pair.  Thus,  the  invariant  is  the  dimension  of  the  state 
space,  i.e.,  4.  Here,  the  invariant  structure  Imposes  no  design  limitations 
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CHAPTER  3 

DYNAMIC  COMPENSATION 

8.1.  Introduction  r 

In  the  last  chapter  we  discussed  the  role  of  the  GHR  and  the 
associated  geometry  In  closed  loop  design.  The  result  was  a  procedure, 
called  the  TCCD,  which  produced  a  state  feedback  matrix.  In  fact,  the  'm 

decoupling  control,  a  key  component  in  this  procedure,  depends  only  on 
the  residual  states  and  not  on  the  outputs.  If  only  the  outputs  are  avail¬ 
able  for  measurement,  the  implementation  of  the  control  requires  dynamic  1 

compensation,  i.e.,  an  observer. 

Since  chained  aggregation  is  based  on  observability,  it  is  not 

$r 

surprising  that  the  GHR  is  useful  for  observer  design.  We  shall  see  that 
the  important  subspaces  here  are  not  £^,  which  are  connected  to  the  input, 
but  the  (C, A) -invariant  subspaces  described  in  Section  4.3.  While  the 
theory  behind  observers  is  well  known  [93],  the  GHR  provides  a  particularly 
simple  exposition  of  the  subject  [94].  The  presentation  here  should  also  be 
considered  in  the  light  of  the  many  other  insights  the  GHR  provides. 

Because  this  observer  construction  is  based  on  the  same  ideas  as 
the  TCCD,  it  has  applications  in  the  other  topics  discussed  in  Chapter  7. 

In  particular,  we  discuss  a  decentralized  dynamic  compensation  scheme  for 
interconnected  systems  and  an  observer  for  the  class  of  nonlinear  systems 
of  Section  7.6. 

Finallv,  we  discuss  the  integration  of  the  ideas  of  near  unobserv- 
ability  and  the  TCCD.  It  seems  clear  that  the  GHR  provides  a  framework 
for  the  introduction  of  the  topological  notions  of  near  unobservability  into 
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compensator  design.  The  benefits  of  such  a  theory  are  equally  obvious. 
Therefore,  we  briefly  discuss  one  possible  implication  of  this  theory.  This 
material  should  be  considered  as  an  outline  for  future  work. 

In  Section  8.2  we  discuss  observer  design  in  the  GHR  framework. 
Section  8.3  applies  these  ideas  to  interconnected  systems  and  nonlinear 
systems.  Section  8.4  uses  the  ideas  of  near  unobservability  to  discuss  static 
output  feedback  and  observer  design. 


8.2.  Observers 

8.2.1.  Residual  state  observers 

After  one  step  of  chained  aggregation,  let  (C.1)-(C.2)  be  repre 

sented  by 


y  -  [cx  0] 


With  respect  to  this  basis,  let  any  feedback  matrix  be 


u 


Let  the  compensator  have  the  structure 

x  *  Mx  +  Ry  +  Ov 
r  r 

u  *  Nx  +  Py  +  v 
r 


(8.2.2) 


(8.2.3) 


with  xr6R“  .  With  some  foresight  we  choose 

M  *  *  *1  +  hh 

L  ’  Wl1  -  (bgr  +  brli>cI1 

n-l2 

P"L1 

q-br. 


Then  the  closed  loop  system  becomes 


1 

>» 

o 

CR 

bgV 

'  y  ' 

'V 

• 

X 

r 

.  t 

®GR 

agr 

BRL2 

X 

r 

+ 

Br 

«  ’ 

*  i 

L  rJ 

„BGR 

0 

*R  . 

X 

L  rJ 

.br. 

(8. 


where  A  *A.+  B  L.  To  show  that  we  have  accomplished  the  desired  compen 
G  G  G  1 

sation,  introduce  the  state  space  transformation 


(8 


In  these  coordinates,  (8.2.5)  becomes 


In  (8.2.7)  the  compensator  states  can  be  Interpreted  as  error  states 


e-x^-x^,  which  are  governed  by  the  dynamics  of  the  residual.  If  these 
dynamics  are  sufficiently  fast  and  stable  (assume  this  for  the  moment),  then 
the  state  of  the  system  is  governed  by  the  desired  closed  loop  matrix.  If 
we  can  show  that  the  compensator  dynamics  can  be  chosen  arbitrarily,  then 
we  have  achieved  our  desired  design.  To  see  this  consider  (8.2.1). 

Introduce  the  state  space  transformation 


I  0 

X  I 


r  J 


(8.2.8) 


Substituting  (8.2.8)  into  (8.2.1) 


VCRX 


L  GR 


R 

VXCR. 


T 

+ 

’  bg  ' 

. 

z 

_br+xbg. 

(8.2.9) 


Note  that  this  transformation  preserves  the  Information  structuring  in 
(8.2.1)  for  any  X.  It  is  easily  seen  that  if  (8.2.1)  is  observable, 
then  so  is  the  pair  (A^,C^).  Hence,  the  poles  of  A^+XC^  can  be  placed 
arbitrarily  by  proper  selection  of  X.  In  particular,  we  can  choose  them  suf¬ 
ficiently  fast  and  stable.  Then  (8.2.9)  replaces  (8.2.1)  in  the  design 


scheme  above. 


The  transformation  in  (8.2.8)  was  used  in  the  study  of  (C,A)- 
invarlant  subspaces  in  Section  4.3.  In  fact,  this  process  of  selecting 
observer  poles  has  the  geometrical  interpretation  of  selecting  a  (C,A)- 
invarlant  subspace  to  have  the  basis 


(8.2.10) 


(see  Section  4.3).  Note  that  the  dynamics  of  the  observer  are  determined  by 
the  induced  map  of  A4KC  on  the  factor  space  Z  mod  J ^  where  K  makes  J  ^ 

(A+KC) -invariant  [95].  Thus  by  selecting  a  basis  which  displays  a  larger 
(C, A) -invariant  subspace,  we  can  construct  a  smaller  order  observer  to 
estimate  part  of  the  residual  (unmeasured)  states. 


8.2.2.  Partial  residual  state  observers 

The  discussion  of  (C, A) -invariant  subspaces  and  the  GHR  in 
Section  4.3  shows  how  part  of  the  residual  state  can  be  reconstructed.  The 
(C, A) -invariant  subspaces  for  (8.2.1)  take  the  where  af^  is  an 

A  -invariant  subspace.  (Recall  that  the  subspaces  J  are  dependent  ona'L..) 

R  i  X 

Define  a  state  space  transformation 


(8.2.11) 


such  that  in  these  new  coordinates,  (8.2.1)  has  the  form 


*GR1  hi  hl2 
BGR2  °  *R2 . 


2  h  + 

Ah. 


bri  u- 


(8.2.12) 


Hence,  T  simply  identifies  an  A^- invariant  subspace.  Mow  assume  the  feed¬ 


back  is  of  the  form 


u  »  L  y  +  L  %  +  v. 


(8.2.13) 


Again  we  use  the  compensator  structure  in  (8.2.3)  with  d(xr) * d(52) .  The 
parameters  in  (8.2.3)  are  given  by 


M  =  hi  +  BR2L2  *  ha 
1  *  (flGR2+BR2Ll)CI1  *  BGR2CI1 


N  -  L, 


(8.2.14) 


P  -  L. 


Q  "  BR2 ‘ 


The  closed  loop  system  becomes 


y  a, 


BGL2  1 


CR1  CR2  BGL2 


^1  Vl.2  BR1L2 


,  R1 

+  v. 


(8.2.15) 
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To  see  thac  we  have  achieved  the  desired  compensation ,•  again  we  introduce 
error  coordinates  as  in  (8.2.6)  where  replaces  x^  and  the  dimensions 
change  accordingly.  In  these  new  coordinates  (8.2.15)  becomes 
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(8.2.16) 


By  proper  selection  of  X  and  T  we  can  choose  the  observer  dynamics  and 
achieve  the  desired  closed  loop  compensation. 

The  question  now  is  how  to  use  this  flexibility.  This  shall  be 
dealt  with  in  later  sections. 


8.3.  Observers  and  the  TCCD 

8.3.1.  Interconnected  systems 

The  observer  construction  of  the  last  section  is  directly  related 
to  the  TCCD  because  we  started  with  a  common  system  representation. 
Therefore,  we  turn  to  the  question  of  decentralized  dynamic  system 
compensation  [94],  Consider  again  the  system  representation  introduced  in 
Section  7.3,  equation  (7.3.1).  We  will  assume  this  representation  was 
obtained  after  one  step  of  chained  aggregation.  Suppose  also  that  with 
respect  to  this  basis,  the  control  is  given  by  (7 . 3 . 3) - (7 . 3 . 5) . 

To  construct  an  observer  for  this  system,  simply  substitute  the 
system  parameters  in  (7.3.1)  into  the  observer  equations  (3 . 2 . 3)- (S  .  2 . 4) . 


i 
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However,  because  of  the  special  structure  of  (7.3.1),  the  observer  decom¬ 
poses  into  two  subsystems,  given  by 

xr  *  (ARi+BRiKRi)xr+  (BGRiKAi^  +  (BGRi,3-i+BARiKGRi,3-i^ 


+  [BARiBRi^V 


(8.3.1) 


-ii 
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-i 
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Ka. 

Al 

LKGRiJ 


y1* 
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Ai,3-i 
GRi , 3-i J 


-3-i  . 

y  + 


^i 

*RiJ 


x1  + 
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v1** 

v1 


i-  1,2. 


This  is  immediately  recognized  as  an  observer  for  each  of  the  subsystems 
(7.3.2).  Thus  this  approach  yields  a  partially  decentralized  dynamic 
compensator  such  that  each  of  the  local  subsystems  can  choose  its  own 
observer  dynamics  as  discussed  in  Section  8.2. 

Each  local  compensator  in  (8.3.1)  requires  knowledge  of  all  the 
interaction  variables  y,  but  not  of  the  other  subsystem's  residual  states. 
This,  design  framework  does  not  appear  to  relax  this  constraint  on  the 
information  exchange  except  in  the  special  case  when  *  BGR21  =  ® * 

However,  in  this  case  we  are  not  free  to  place  the  compensator  poles  (in 
general,  the  transformation  (8.2.8)  will  introduce  coupling  between  the 
compensators) .  If  it  turns  out  that  the  open  loop  residual  poles  are 
suitable  observer  poles,  then  we  can  use  this  compensation  scheme.  But  this 
implies  that  the  residual  variables  do  not  contribute  much  to  the  open  loop 
system,  and  suggests  that  the  original  model  can  be  reduced  to  the  aggregate 


subsystem  alone. 
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8.3.2.  A  nonlinear  observer 

It  should  be  clear  now  how  to  construct  observers  from  the  GHR. 

We  note  here  that  this  structure  can  also  be  extended  to  the  class  of 
nonlinear  systems  discussed  in  Section  7.6  [83].  Given  the  system  (7.6.1), 
suppose  that  y(t)  represents  the  measured  outputs.  After  one  step  of 
chained  aggregation  let  the  system  be  represented  as 


y 

lAj  L"  21 


'F  F 
11  12 


1 

^  1 

_ 1 

+ 

'V 

u  + 

‘V 

*r  J 
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L  u  J 

.°2. 

f(y) 


(8.3.2) 


r 


y  -  [cx 


o] 


r  J 


Then  using  the  observer  construction  of  Section  8.2,  the  states  x^  are 
estimated  by 

*r  =  F22*r  +  F21Clly  +  V(y)  +  V*  (8’3*3) 


This  is  a  linear  observer  whose  dynamics  can  be  selected  by  the  method  of 
Section  8.2.  This  seemingly  simple  construction  hides  the  fact  that  it  is 
difficult  to  build  an  observer  for  any  nonlinear  system. 


8.4.  Near  Unobservabilitv  in  Compensation 
8.4.1.  Output  feedback 

In  Section  8.2,  we  showed  how  to  construct  an  observer  which 
estimated  only  some  of  the  residual  states.  In  this  section  we  exploit 
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that  freedom  by  discussing  the  role  of  near  unobservability  in  compensation 
schemes.  The  key  idea  is  to  analyze  how  static  output  feedback  affects  the 
closed  loop  system  poles. 

We  shall  discuss  systems  represented  by  the  second  generic 
case  (Section  4.2.3).  After  one  step  of  chained  aggregation  the  system 
can  be  represented  as 


(8.4.1) 


Thus,  X (A^)  represents  the  invariant  zeros  of  (8.4.1).  Consider  output 
feedback  of  the  form 


u  *  Ly. 

Then  the  closed  loop  system  has  the  form 


(8.4.2) 


A1+B1L  A2' 

y" 

A-  A. 

X 

3  4  J 

L  rJ 

(8.4.3) 


Here  we  note  that  by  using  Theorem  3.2.3,  as  I  Lll  -*-00,  the  eigenvalues  of 
(8.4.3)  go  to  infinity  and  X(A^)  (cf.,  the  analysis  in  Section  3.5),  as 


is  well  known. 
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Suppose  we  introduce  the  orthogonal  transformation 


'*1  rpu  pi2] r y I _  _ r y ' 
Aj  lP21  P22 J  xr J  IV 


where  P  is  described  in  (3.2.7)  and  we  choose  P  such  that 


(8.4.4) 


(8.4.5) 


is  A-invariant.  Then  we  may  write  (8.4.3)  in  the  new  coordinates  as 


M  Jr  °]+rpnBi 

Xr-  }.A3  V  .P21B1 


FP11  P11B1FP21 

Al  P21BlFP21 


(8.4.6) 


where  we  have  separated  out  the  effect  of  the  feedback.  In  this  context 
we  can  think  of  the  feedback  as  a  perturbation  of  A.  Using  exactly  the 
same  techniques  as  we  used  for  near  unobservability,  Stewart  [26]  has 
given  perturbation  theorems  as  follows: 

Theorem  8.4  [26]:  Let  A  and  E  be  nxn  matrices  given  as 


A3  A4 


E1  E2 
E3  EA 


6  =  sep(A  ,A, )  -  IE.I  -  HE,  II 
1  <4  1  4 


I 


IE2KIA3I  +  IE2I)  x 

7%  -  4 


there  is  a  matrix  P ,  satisfying 


such  that 


2IE  I 

,P1«  S  i 


rwl,"'’] 


span  an  invariant  subspace  of  A+E.  Furthermore,  X(A+E)  are  given  as  the 
disjoint  union  of 


X (  (I+PjpJ)**  [A1+E1-P1  (A3+E3)  ]  (I+P^J)  1/2  ) 
x  (  (i+pJp1)  1/1  [  a4+e4+(a3+e3-  p1  ]  (  i+pJp1)  1/2 )  . 


We  would  like  to  apply  Theorem  8.4.1  to  (8.4.6)  when  the  original 
system  (8.4.1)  is  nearly  unobservable,  i.e.,  when  9 PD  defining  (8.4.4)  is 
small.  Recall  that 


Pu*  I,  PtJ  *0  as  P-0  U3 


(see  (3.2.7)).  Hence,  it  follows  from  (8.4.6)  that 


11B1LP11 

-  B  L 
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P 

-  0 

2l“lLP21 

♦  0 

as 

P 

-*■  0 

(8.4.7) 


(8.4.8) 


In  fact,  E4  tends  to  zero  quadratically!  Assuming  Theorem  8.4.1  holds,  it 


follows  that  the  closed  loop  eigenvalues  tend  to  X(A  +B  L)  and  X(A  ). 


The  point  of  this  analysis  is  to  describe  the  quantitative  effect 
of  static  output  feedback  on  the  two  sets  of  open  loop  poles  identified 
in  (8.4.6)  for  nearly  unobservable  systems.  Applying  Theorem  8.4.1  to 
(S.4.6),  we  see  that  the  poles  are  modified  by  1)  an  additive  correction 
term  whose  effect  is  described  in  (8.4.8),  and  2)  a  term  in  P^  with  the 
structure  (A^+E^P^.  Now  contains  the  output  feedback  contribution  which 
is  linear  in  P.  The  other  parameter  here  is  A^.  If  A^  is  large  then  the 
output  feedback  (8.4.2)  is  expected  to  have  a  large  effect  on  the  eigenvalues 
of  A^.  On  the  other  hand,  if  A^  is  small,  then  so  is  A^  (since  the  system 
is  assumed  to  be  nearly  unobservable).  This  has  two  interpretations.  By 
Section  3.4,  this  implies  the  open  loop  system  is  weakly  observable  if 
there  is  an  eigenvalue  separation.  Secondly,  by  Section  4.4  there  is  almost 
a  pole-zero  cancallation.  In  this  case  it  is  well  known  that  large  control 
energy  is  needed  to  move  the  open  loop  poles. 

The  above  analysis  easily  extends  to  any  i-th  unobservable  subspace 
£±.  Simply  redefine  the  feedback  matrix  partitioning  in  (8.4.6)  along  with 
P  and  P^. 

8.4.2.  Observer  Design 

From  the  analysis  of  near  unobservability,  if  £^  is  nearly 
unobservable,  it  is  near  a  A-invariant  subspace.  Denote  the  eigenvalues 
of  this  subspace  by  A2  and  the  rest  of  the  eigenvalues  of  A  by  A  .  From  the 
analysis  above,  we  see  that  static  output  feedback  strongly  affects  A  but 
not  i\^.  There  are  two  evaluations  of  •  First,  since  these  eigenvalues 
contribute  little  to  the  system  (they  are  nearly  unobservable)  and  they 
are  not  affected  much  by  feedback  to  the  other  modes,  we  can  safely  ignore 


them.  That  is,  we  do  a  nodal  reduction,  as  described  in  Section  6.3,  and 
apply  the  design  techniques  above  to  it. 


On  the  other  hand,  suppose  the  nodes  associated  with  are 
Important  and  cannot  be  ignored.  Since  to  move  them  would  require  large 
static  gains,  we  must  use  a  dynamic  compensator.  The  modes  associated  with 
A^,  on  the  other  hand,  are  heavily  influenced  by  static  feedback.  This 
suggests  that  we  build  an  observer  that  estimates  only  the  nearly  unobserv¬ 
able  modes. 

Both  of  these  points  of  view  fits  into  the  observer  framework  of 
Section  8.2.  Indeed  it  tells  us  how  to  select  the  transformation  T  in 
(8.2.11)  to  obtain  the  observer  parameters  in  (8.2.14).  If  the  nearly 
unobservable  states  are  to  be  ignored,  in  (8.2.12)  we  select  MA^)  - 
On  the  other  hand,  if  the  nearly  unobservable  modes  are  to  be  measures,  we 
have  MA^)  *  A2* 
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CONCLUSION 

This  thesis  has  preseated  a  detailed  study  of  the  GHR  and  chained 
aggregation  with  applications  to  standard  problems  in  linear  system  theory, 
such  as  model  reduction  and  compensator  design.  In  retrospect,  the  work 
should  be  viewed  as  a  whole.  Then  it  becomes  clear  that  the  GHR  is  a 
unifying  framework  in  which  the  fundamental  structure  of  linear  systems  is 
exposed.  Thus,  it  is  shown  that  the  GHR  is  useful  in  many  problems  and  it 
is  believed  it  will  prove  useful  in  many  more. 

The  theoretical  background  for  this  success  is  the  twofold  inter¬ 
pretation  of  the  GHR  in  both  geometric  and  matrix  terms.  This  allows 
fundamental  properties  to  be  stated  concretely  while  still  retaining  their 
abstract  nature.  For  example,  the  fundamental  result  is  that  the  GHR 
explicitly  identifies  the  i-th  unobservabllity  subspaces  {jC^}.  The  matrix 
representation  is  useful  because  it  allows  us  to  estimate  distances 
between  subspaces  and  so  introduce  near  unobservability.  The  fact  that  this 
is  a  useful  topology  may  be  attributed  to  the  GHR.  The  abstract  charac¬ 
terization  of  the  GHR  is  useful  because  it  connects  the  open  loop  {£^} 
subspaces  to  their  closed  loop  counterparts,  (A, B) -invariant  subspaces  and 
so  with  the  geometric  literature.  In  this  way  we  are  given  fundamental 
interpretations  of  the  design  procedures  which  result  from  the  GHR. 

The  analysis  of  the  GHR  concentrates  in  three  areas.  The  first 
is  the  topological  characterization  of  the  subspaces  {£  ^ }  and  invariant 
spaces,  i.e.,  near  unobservability.  The  second  area  is  the  behavior  of  {£^} 
under  the  action  of  the  input.  The  third  area  is  the  system  decomposition 


induced  by  the  GHR.  These  are  treated,  respectively,  in  Chapters  3,  4, 
and  5. 

Even  though  they  are  given  independent  treatment,  these  topics 
are  intimately  related  through  the  GHR.  In  fact,  combinations  of  these 
ideas  lend  insight  into  well-known  problems.  In  Chapter  6  model  reduction  is 
discussed  by  combining  system  decomposition  with  near  unobservability. 

Hence,  we  are  able  to  clarify  the  GHR  in  model  reduction,  the  original  use 
of  the  GHR.  By  combining  the  system  decomposition  with  the  closed  loop 
subspaces,  insight  is  gained  into  a  recent  control  design  procedure,  the  TCCD 
In  fact,  we  are  able  to  connect  it  with  geometric  design  procedures  and 
extend  the  TCCD  to  various  types  of  interconnected  systems,  optimal  control, 
output  decoupling,  nonlinear  systems,  and  observer  design.  Because  of  the 
recent  introduction  of  near  unobservability,  the  integration  of  all  three 
aspects  of  the  GHR  is  still  in  preliminary  stages.  However,  this  direction 
of  research  shows  great  promise.  A  preliminary  application  is  given  in 
Chapter  8  to  reduced  order  compensators. 

Because  of  the  fundamental  nature  of  this  work,  opportunities 
for  future  research  abound.  Perhaps  the  most  promising  is  the  unification 
of  near  unobservability,  the  closed  loop  subspaces  {1^}  and  system  decompo¬ 
sition  into  a  complete  compensator  design  theory.  The  GHR  framework  should 
unify  and  clarify  many  proposed  design  procedures.  The  almost  natural 
presence  of  time  scales  and  numerical  analysis  background  of  near  unobserv¬ 
ability  should  give  this  theory  breadth  and  depth.  In  addition  to  linear 
systems,  classes  of  nonlinear  systems  apparently  fit  in  this  framework. 

Thus,  the  GHR  is  a  natural  vehicle  for  generalizing  linear  system  concepts 
to  nonlinear  systems,  particularly  the  geometrical  aspects.  Finally,  we 
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note  that  the  numerical  analysis  of  linear  systems  has  become  a  topic  of 
Interest  In  its  own  right.  The  close  connection  of  chained  aggregation  and 
MCA  to  recent  numerical  studies  indicates  great  promise  for  the  ideas  of 
the  GHR  to  have  serious  computation  applications.  This  by  no  means  exhausts 
the  possibilities  for  future  research.  Many  other  directions  are  indicated 
in  the  text. 


APPENDIX 


TWO  AREA  TOWER  SYSTEM  MODEL 

A  two  area  power  system  with  each  area  containing  two  thermal  plants 
is  constructed  from  Caiovic  [96  ] .  The  system  is  modeled  by 


*  x  ■  Ax  +  Bu  +  Euj 
y  »  Cx 


(A.l) 


19-4  2  3 

where  xtR  ,  ufcR  ,  iu€R  and  y€R  .  The  state,  control  and  outputs  variables 
have  the  following  physical  meanings : 


*1’  *12 
V  *13 

x3’  X14 

*4’  x15 

x5’  X16 

x6’  X17 

x7’  X18 

x8’  x19 

x9 ’  X11 


valve  position  displacement  in  first  thermal  unit  of  area  1  and  2. 

power  output  displacement  of  HP  turbine  in  first  thermal  unit  of 
area  1  and  2. 

power  output  displacement  of  IP  turbine  in  first  thermal  unit  of 
area  1  and  2. 

power  output  displacement  of  LP  turbine  in  first  thermal  unit  of 
area  1  and  area  2. 

valve  position  displacement  in  second  thermal  unit  of  area  1  and 

2. 

power  output  displacement  of  HP  turbine  in  second  thermal  unit 
of  area  1  and  2. 

power  displacement  of  I?  turbine  in  second  thermal  unit  of  area 
1  and  2. 

power  displacement  of  LP  turbine  in  second  thermal  unit  of  area 
1  and  2. 

frequency  deviation  of  area  1  and  2. 

tie- line  power  flow  connecting  area  1  and  2. 

set  point  adjustment  of  first  thermal  unit  in  area  1  and  2. 


U2>  u4  **  set  point  adjustment  of  second  thermal  unit  in  area  1  and  2. 

Uj',  u.'^  ■  load  disturbance  of  area  1  and  2. 

■  frequency  deviation  of  area  1. 

“  tie- line  power  flow  of  area  1  and  2. 
y^  -  frequency  deviation  of  area  2. 

The  system  matrices  (A,B,E,C)  are  given  in  (A.2-A.3).  The  parameters  for  the 
first  area  (the  second  being  identical  in  structure)  are  as  follows: 

r  -  permanent  speed  droop 

T£  ■  time  constant  of  the  system  pilot  valve-servomotor  turbine  gates 

-  time  constant  of  the  turbine;  characterizes  the  delay  between  control 
valve  action  and  turbine  nozzle  action 
Tr  «  time  constant  characterizing  the  time  delay  in  the  HP  turbine 
rehearter  and  reheat  piping 

Tn  «  time  constant  characterizing  the  time  delay  in  the  IP  turbine  and 
crossover  piping 

cv  «  fraction  of  total  power  generated  by  HP  turbine 
cg  *  fraction  of  total  power  generated  by  IP  turbine 
e^  -  coefficient  chacterizing  the  influence  of  frequency  variation  on 
turbine  output  variation  (turbine  self-regulation) 
kt  ■  proportionality  factor  connecting  the  control  valves  position 
variation  and  HP  turbine  output  variation  in  the  steady  state 
(k£  for  IP  and  LP  turbine  are  very  close  to  1  since  power  variations 
of  these  turbines  in  the  steady  state  are  equal) 
e  »  participation  of  the  unit  in  total  system  oucout 
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“  load  turbine  and  system  self-regulation  coefficient  not  including 

the  participation  of  the  unit  under  consideration  (lead  characteristic) 

c  l 

e  *  ep  -  i“LerTr  2 

T  »  system  acceleration  time  constant  T  •  Tp  +  i-lei^G 

T  *  time  constant  due  to  the  mechanical  inertia  of  the  rotating  masses  in 
P 

the  load 

T_  *  unit  acceleration  time  constant,  T_  ■  2H. 

Gi  Gi  t 

H  *  inertia  time  constant. 

The  non-zero  elements  of  the  matrices  A,  B,  C,  and  E  expressed  as  functions 
of  the  physical  parameters  are: 
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The  parameters  h^2 ,  h^,  h23,  and  h^  describe  the  tie- line  dynamics  and 
interconnection  with  the  two  areas. 


AGGREGATED  MODEL 

The  following  values  are  obtained  for  the  first  area  (i=*l)  in 
equation  (5.4.8)  in  terms  of  the  parameters  in  (A. 2): 
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